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The inaugural issue of the Journal of the Audio Engineering 
Society is expected to become a collector's item. 


Copies of this first issue are still available: Price to members, 
$2.00 each. Price to non-members,* $4.00 each. 


All other back issues (April, July, and October, 1953; January, 
April, July, and October, 1954; January, April, July, and October, 


1955) are $1.00 each to members and $2.00 each to non-members. 
Please address orders to the 
Audio Engineering Society 
P. O. Box 12 


Old Chelsea Station 
New York 11, N. Y. 


* A membership application blank is included at the rear of the present issue of the Journal. 
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Synthesis of Speech By Electrical Analog Devices” 


KENNETH N. STEVENS 


Acoustics Laboratory, Massachusetts Institute of Technology, 
Cambridge, Massachusetts 


Two electrical devices for simulating the operation of the speech mechanism are described. One 
synthesizer is a variable transmission-line analog of the vocal tract. The other synthesizer incorpor- 
ates variable simple-tuned circuits to represent vocal-tract resonances and can be made to generate 
simple sentences under the control of a teletype tape and relays, or an analyzer that extracts appro- 


priate control signals from human speech. 


NE APPROACH to the electrical synthesis of speech 

is to attempt to simulate, to a greater or lesser extent, 

the mechanism that we ourselves utilize when we generate 

speech. It is profitable, therefore, to study and try to under- 

stand the human vocal mechanism before we try to build 

analogous electric circuits. Let us examine the speech mech- 

anism briefly before we go on to describe the principles of 
operation of our electrical speech synthesizers. 

In Fig. 1 we show a cross-section through the head during 
the production of a particular speech sound |i|.!_ The vocal 
tract is an acoustic tube terminated by the vocal folds at 
one end and by the mouth opening at the other. Such a 


* Presented at the Seventh Annual Convention of the Audio Engi- 
neering Society, New York, October 12-15, 1955. This work was sup- 
ported by the U. S. Air Force under Contract No. AF 19(604-626). 

1T. Chiba and M. Kajiyama, The Vowel, Its Nature and Structure, 
Tokyo-Kaiseikan Publishing Company, Ltd., Tokyo (1941). 


representation of the vocal tract can be visualized more 
clearly in the bottom portion of Fig. 1. Here the vocal tract 
is straightened out, and the radius of a cylindrical tube of 
equal cross-sectional area is plotted as a function of distance 
from the glottis. When we utter a vowel, we “turn on” the 
vocal cords at one end of the tube, we shape the walls of 
the tube appropriately, and sound radiates from the mouth 
opening. Since the dimensions of the tube are comparable 
to a wavelength in the frequency range of interest, the tube 
acts like a transmission line or organ pipe. It has certain 
normal modes of vibration or resonances. When excited by 
a source rich in harmonics, certain harmonics of the source, 
ie., those in the vicinity of the vocal-tract resonances, are 
accentuated, while others are attenuated. When we shape 
our vocal tract in different ways we change the resonance 
frequencies. We seem to listen for these changes when we 
perceive spoken vowels and the transitions between vowels. 
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Fic. 1. The upper part of the figure shows a section through the 
midline of the vocal tract during the production of the vowel |i| 
(meat). The lower part of the figure shows the radius of a eylin- 
drical tube of cross-sectional area equal to that of the voeal tract, 
plotted as a function of distance from the glottis. (From Chiba 
and Kajiyama’). 


The formants discussed by Mr. Borst? are really the acoustic 
manifestations of these vocal-tract resonances. 

When we produce certain consonants, we constrict the 
vocal tract at one point, and create turbulent air flow at 
the constriction. The noise so generated may then be modi- 
fied by vocal-tract resonances, and appear at the output as 
f or s|, for example. Other consonants may be generated 
by vibrating the vocal cords in conjunction with turbulence 
noise (as in |v| or |z|), by constricting the vocal tract com- 
pletely and then suddenly releasing the constriction (as in 
|p| or |t|), or by coupling to the nasal cavity. 

Mr. Dudley* has indicated how this view of speech pro- 
duction has certain analogies to carrier telephony. We 
“modulate” certain sound sources by the relatively slow 
movements of the vocal tract. In a sense we are viewing 
the speech mechanism in the form shown in Fig. 2. This 
figure shows, in a very schematized and general form, how 
the vocal mechanism operates, and suggests, therefore, how 
we should proceed in the development of an electrical syn- 
thesizer that simulates this operation. Sources, whether they 


2 John M. Borst, “Use of Spectrograms for Speech Analysis and 
Synthesis.” This paper appears in the present issue of the JouRNAL. 
See Table of Contents. 

% Homer Dudley, “Fundamentals of Speech Synthesis,” J. Audio 
Eng. Soc., 3, 170-185 (October 1955). See also, H. Dudley, “The 
Carrier Nature of Speech,” Bell System Technical J., 19, 495-515 
(October 1940). 


SYNTHESIS OF SPEECH BY ELECTRICAL ANALOG DEVICES 
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be noise or “buzz,” excite variable linear acoustic circuits 
that represent the vocal tract, at least from the point of 
view of their input-output relations. In natural speech we 
control the motions of our vocal tract and vary the excita- 
tion in proper time sequence to obtain an intelligible out- 
put. In the synthesizer we should like to approximate the 
same process. 

This view of the speech-generating process is applicable 
to other sound-generating devices. In a musical instrument 
like a violin, a bow is drawn across a string to generate a 
continuous source of excitation; the strings of variable length 
form the variable circuits. Dr. Olson‘ elaborates on this 
theme in his paper on synthesis of music. 

At MIT we have built two electrical synthesizers of 
speech that operate on these general principles. The first 
is an electrical transmission-line analog of the vocal tract, 
and is similar in some respects to Dunn’s electrical vocal 
tract,* discussed by Mr. Dudley. The other is a “resonance” 
type of synthesizer,® not unlike the “Octopus” synthesizer* 
described by Mr. Borst. Synthesizers of both the analog 
type and the resonance type have been built in many other 
laboratories.*-*:!° 


SPEECH 
OUTPUT 


VARIABLE 


SOURCES CIRCUITS 


CONTROL SIGNALS 


Fic. 2. Block diagram depicting the speech mechanism as var- 
iable acoustic cireuits excited by appropriate sources, 


4H. F. Olson, “Electronic Music Synthesizer.” Presented at the 
Seventh Annual Convention of the Audio Engineering Society, New 
York, October 12-15, 1955. See also, H. F. Olson, “Electronic Music 
Synthesizer,” J. Acoust. Soc. Amer., 27, 595-612 (May 1955). 

5H. K. Dunn, “The Calculation of Vowel Resonances, and an Elec- 
trical Vocal Tract,” J. Acoust. Soc. Amer., 22, 740-753 (November 
1950). 

6K. N. Stevens, R. P. Bastide, and C. P. Smith, “Electrical Syn- 
thesizer of Continuous Speech,” J. Acoust. Soc. Amer., 27, 207 (Janu- 
ary 1955). 

7W. W. Meeks, J. M. Borst, and F. S. Cooper, “Syllable Synthe- 
sizer for Research on Speech,” J. Acoust. Soc. Amer., 26, 952 (Janu- 
ary 1954). 

8 C. Gunnar M. Fant, “Speech Communication Research,” IVA, 24, 
331-337 (1953). 

9 W. Lawrence, “The Synthesis of Speech from Signals which Have 
a Low Information Rate,” in Communication Theory, W. Jackson, 
ed., Chapter 34, Butterworth Scientific Publications, London, 1953. 

10 E. S. Weibel, “Vowel Synthesis by Means of Resonant Circuits,” 
J. Acoust. Soc. Amer., 27, 858-865 (September 1955). 
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ELECTRICAL ANALOG OF THE VOCAL TRACT?! 


In the transmission-line analog we view the vocal tract 
as an acoustic transmission line of variable dimensions. This 
distributed acoustic line has, of course, an electrical analog. 
The distributed line can be approximated by lumped LC 
sections, if we use enough of them. In the construction of 
our analog synthesizer we have divided the total length of 
the vocal tract into 35 sections, each of which represents a 
¥-cm length of the tract. Each acoustic section is approxi- 
mated by an electrical r-section as shown in Fig. 3. In our 


L-yeu-+| 


VOLUME VELOCITY 
PRESSURE 


CURRENT 
VOLTAGE 


AREA 
VELOCITY OF SOUND 


Fie. 3. A cireular cylinder and its electrical analog. Analogous 
acoustical and electrical quantities are shown. 


version of the analog each of these 14-cm sections is variable 
in cross-sectional area, i.e., we can vary the value of Y C/L 
as shown in the figure. The range of variation is from zero 
to about 17 sq cm, the largest area we can produce in the 
vocal tract, in general. Volume velocity is analogous to 
electric current and sound pressure is analogous to voltage. 
Thus we have, in effect, 35 “control” variables that control 
the sizes of the 35 sections. In the present form of the 
analog these variables are changed by means of switches 
that are operated by hand. 

To the electrical transmission line we can apply noise or 
buzz excitation as required. The output of the vocal cords 
is represented by a current source, looking like a sawtooth 
wave, applied to one end of the line. We can, if we wish, 
vary the frequency and amplitude of this source. If desired, 
an electrical noise source can be applied at any point of 


11 The electrical analog of the vocal tract is described in more detail 
in K. N. Stevens, S. Kasowski, and C. G. M. Fant, “An Electrical 
Analog of the Vocal Tract,” J. Acoust. Soc. Amer., 25, 734-742 (July 
1953). 


constriction along the line, to simulate the noise of turbu- 
lence. 

To set up the synthesizer to produce a given sound we 
have to set 35 switches in the correct positions. The correct 
positions are determined in part from X-ray pictures of the 
human vocal tract when it produces the particular sound. 
Figure 1 is an example of a tracing made from such an X-ray 
picture. From these X-ray pictures we determine the cross- 
sectional area at each %2-cm point along the vocal tract. 
Following adjustment of the 35 switches, the electrical vocal 
tract is excited by an appropriate buzz or noise source, and 
easily identifiable speech sounds are heard when the output 
is transduced by a loudspeaker.'” 

It is obviously a tedious process to adjust 35 switches 
each time the electrical vocal tract is to synthesize a differ- 
ent sound. We know, however, that the switch settings, i.e., 
the variov’s cross-sectional areas, are not independent. There 
must be a smooth change of cross-sectional area from one 
point to the next as we move along the vocal tract. Perhaps, 
then, if we make use of such constraints, we can specify the 
vocal-tract shape with a much smaller number of variables 
than 35. A specification of speech articulations in relatively 
simple terms would be of great utility in the development 
of a control system for a dynamic analog of the vocal tract, 
in which the electrical elements would be controlled dy- 
namically instead of by hand switches. 

From studies of X-ray pictures it appears as though we 
might be able to specify the vocal-tract shape with reason- 
able accuracy, using just three variables.’* These three vari- 
ables are illustrated by the idealized vocal-tract shapes shown 
in Fig. 4. Again we are considering the vocal tract as an 
acoustic tube, but we are limiting the range of shapes that we 
allow for the tube, because we know that certain physical 
constraints are imposed by the structures that form the 
human vocal tract. Each shape depicted in Fig. 4 is defined 
by three numbers: 

1. The size of the mouth opening, given by A//, which 
is the ratio of the cross-sectional area to the length of the 
front portion of the tube. 

2. The position of the point of greatest constriction of the 
tongue, given by do, the distance from the glottis to this 
point. 

3. The degree of constriction, given by ro, the effective 
radius at the point of greatest constriction. 

Given the values of dy and ro we then specify a smooth 


12 A. S. House and K. N. Stevens, “Auditory Testing of a Simplified 
Description of Vowel Articulation,” J. Acoust. Soc. Amer., 27, 882- 
887 (September 1955). 


13 The scheme for describing vowel articulation in terms of three 
parameters is presented in more detail in K. N. Stevens and A. S. 
House, “The Development of a Quantitative Description of Vowel 
Articulation,” J. Acoust. Soc. Amer., 27, 484-493 (May 1955). 
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DISTANCE FROM GLOTTIS IN CM 
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Fig. 4. Idealized voeal tract configurations for typical values of 
parameters 1‘, dy and A/l. (From Stevens and House.) 


parabolic outline for most of the tube. Figures 4a and 4d 
show the idealized articulatory shapes for two different val- 
ues of dy with the same constriction size ro. When 7p is as 
large as 1.2 cm, the cross-sectional area of the tube is essen- 
tially constant (except at the mouth opening), i.e., there is 
no constriction, as in Fig. 4c. 

It is of interest now to establish relations between these 
three numbers that describe speech articulations and the 
speech sounds produced by these articulations. Such a re- 
lation is shown in Fig. 5. For purposes of illustration we 
are keeping the constriction size (ro) constant, and we are 
varying the size of the mouth opening and the position of 
the constriction. On the vertical axis of Fig. 5 we plot the 
size of the mouth opening, from a long, narrow opening 
(A/l = 0.1 cm) to a large, short opening (A// = 17 cm). 
The horizontal axis gives the distance from the glottis to 
the point of greatest constriction. On the graph sheet we 
‘plot contours that indicate the range of mouth openings and 
constriction positions that give rise to the American English 
vowels designated within the contours. We observe, for 
example, that when the constriction is well to the front of 
the vocal tract, the vowel |i) results for a wide range of 
mouth openings. When the constriction is well to the back 
and the mouth opening is small, the vowel |u| is produced. 
Other vowels occupy intermediate positions. Such observa- 
tions are in accordance with our experience and agree with 
previous studies of linguists and phoneticians.’* Contours 


14G. E. Peterson and H. L. Barney, “Control Methods Used in a 
Study of the Vowels,” J. Acoust. Soc. Amer., 24, 175-184 (March 
1952). 


SYNTHESIS OF SPEECH BY ELECTRICAL ANALOG DEVICES 


5 


of the type shown in Fig. 5 have been extended to include 
consonants as well as vowel regions. 


RESONANCE TYPE OF SPEECH SYNTHESIZER 


We have observed that the vowels and some other sounds 
in speech are characterized by vocal-tract resonances or 
formants that change in frequency during continuous speech. 
The acoustic resonances or formants can be simulated elec- 
trically by variable simple-tuned circuits. Excitation of 
these circuits by a quasi-periodic sequence of impulses should 
yield vowel sounds at the output. 

Fig. 6 shows a block diagram of the vowel-generating por- 
tion of our resonance-type synthesizer. The vowels and 
vowel-like sounds are generated in the synthesizer by excita- 
tion of four cascaded tuned circuits by impulses, as shown 
in the figure. In three of the tuned circuits the capacitance 
elements are reactance tubes and are controlled by voltages. 
The fourth tuned circuit is fixed, with a resonant frequency 
of about 3500 cps. As the three control voltages are ad- 
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Fic. 5. Contours of vowel articulation. The contours delineate 
the ranges of articulatory positions that produce the vowel values 
shown. The degree of constriction r, is constant at 0.4 em. The 
vowel symbols are those of the International Phonetic Association, 
(After Stevens and House.) 
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justed to yield the proper formant frequencies, different 
vowels are produced. 

To generate consonant sounds, additional control signals 
modulate a noise source that may be filtered in different 
ways to produce different consonants. These are generated 
in a channel that is separate from the vowel channel. 

Let us look at the block diagram for the entire synthesizer, 


SS Han _tir-< 


F3 F2 Fl Fa 

Fic. 6. Schematized diagram of the vowel-generating portion of 
resonance type of synthesizer. The four resonant cireuits repre- 
sent the first four vowel formants, labelled F1 through F4. 


shown in Fig. 7. Across the top of this figure is the vowel 
channel we have already described, with the buzz generator 
at the left, passing through a modulator and then through 
the reactance-tube circuits. Across the bottom is the un- 
voiced channel, consisting of a noise generator, modulator, 
and filters. A teletype reader or a keyboard operates a cen- 
tral relay control circuit which controls the pitch, turns the 
buzz on and off smocthly when required, sets the formant 
frequencies for the vowels and consonants, turns the noise 
generator on and off smoothly when required, and selects 
appropriate filters for the unvoiced channel, depending upon 
the consonant to be generated. The relay control circuit 
feeds discrete data to the synthesizer, but smoothing circuits 
are included to provide smooth formant transitions and 
smooth modulation of the sources. The synthesizer could 
also be adapted for operation from continuous control sig- 
nals. 

The teletype reader feeds data to the synthesizer at about 
the same rate that we generate sequences of speech sounds 
when we talk. Data are read from the teletype tape peri- 
odically about 6 times per second. One 5-channel tele- 
type reader can be used to select any one of 32 different 
sounds, a distinctive code being used for each phoneme. At 
present only about 20 different sounds can be generated: 
10 vowels and 10 consonants. Two channels on a second 
teletype reader are used to provide information concerning 
the vocal frequency. This reader is set up to provide three 
possible values of fundamental frequency, and smoothing 
circuits are used to ensure smooth changes of the frequency 
from one value to another. 

The rate at which information is fed into the synthesizer 
by the 2 teletype readers is about 40 bits per second. This 
is approximately 1/1000th of the channel capacity normally 
utilized in a speech communication link, and between 1/50th 


man ome 


and 1/100th of the vocoder™ channel capacity. 

To examine in more detail the operation of the synthesizer 
let us look at Fig. 8. This demonstrates the operations per- 
formed by the synthesizer in generating the simple word 
sat. At the top of Fig. 8 we show the symbols coded on 
the teletype tape. The symbol at the left is for the |s|, the 
next two symbols are for the |z!, and the final symbol is for 
the |t|. Below the sketch of the teletype tape we show a 
diagram that indicates how the formant frequencies'® move 
with time as the symbols on the tape give instructions to 
the synthesizer. Time is plotted horizontally and frequency 
vertically. The vertical lines spaced periodically across the 
diagram indicate the times at which the discrete instructions 
are read from the teletype tape. 

The first instruction tells the synthesizer to say |s|. The 
formant frequencies move to positions indicated by the 
dashed lines at the left of the figure. However, the reactance 
circuits are not excited by the buzz generator for this un- 
voiced fricative sound, and for this reason the formant fre- 
quencies are shown by dashed lines instead of by solid lines. 
When the |z| code is read on the teletype tape, the frequen- 
cies of the three formants are told to move to new positions. 
Since smoothing circuits are incorporated in the control sys- 
tem, the formant frequencies move to their new positions 
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Fic. 7. Block diagram of the entire resonance synthesizer, show- 
ing the vowel channel across the top, the channel for unvoiced con- 
sonants at the bottom, and the relay control circuit. 


with some delay as indicated on the figure. While the form- 
ant frequencies are in motion toward their new positions, 
the buzz generator is turned on at the point where the 
dashed lines become solid lines. Thus the reactance tubes 


15H. Dudley, “The Vocoder,” Bell Lab. Rec., 18, 122-126 (Decem- 
ber 1939) 


16In this paper the word “formant” is used to indicate a natural 
frequency of the vocal tract. For vowel sounds, the manifestation of 
a formant in the sound wave is a concentration of energy in a particu- 
lar frequency range; for consonant sounds there may be no acoustic 
manifestation of some formants, i.e., some of the vocal tract resonances 
may not be excited. 
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Fic. 8. This figure demonstrates the operations performed by 
the resonance synthesizer in generating the word sat. The code 
appearing on the punched tape is shown at the top of the figure, 
the variation of the first three resonant frequencies is indicated in 
the middle diagram, and a simplified representation of the inten- 
sity-frequency-time pattern of the synthesized speech is shown at 
the bottom. 


are excited while their resonant frequencies are in motion 
towards new positions, and, as a result, appropriate formant 
transitions appear at the output of the synthesizer. The next 
code on the teletype is another |z| code, and this tells the 
formants to stay in the same positions, and tells the buzz 
generator to remain on. When the |t| code is read, the 
formants again move to new positions, which they do with 
some delay as before. While the formants are in motion 
the buzz generator is turned off, and a final transition is 
obtained in the vowel at the synthesizer output. 


The lower portion of Fig. 8 is a simplified representation 
of the intensity-frequency-time pattern appearing at the 
synthesizer output. When the |s| code is read from the tele- 
type tape the modulator at the output of the noise generator 
is turned on with some delay, and is fed through a filter 
appropriate for the |s| sound (high-pass with a cut-off fre- 
quency of 4000 cps). After the |z| code is read the noise 
generator is turned off and the buzz generator is turned on 
at the same time. After the buzz generator is turned off 
at the termination of the vowel there is a silent period, 
following which a burst of noise is generated, giving the 
effect of an unvoiced stop consonant. 

Figure 9 shows an actual spectrogram of the word sat as 
synthesized by the machine. The band of noise for the |s\, 
the transitions at the beginning and the end of ||, and the 
burst of noise for the |t| are all clearly observable on the 
spectrogram. 

To illustrate further the operation of the synthesizer we 
have a tape recording of several sentences generated by the 
machine." The first sentence is Where are you? This is 


17 In the verbal presentation of this paper at the AES Convention 
on October 12, 1955, a demonstration recording was played at this 
point. 
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Fic. 9. Actual intensity-frequency-time pattern of the word sat 
as generated by the resonance synthesizer. 


s 

en q 

ooo 4 
eS 
: 

he ' a : ; 

4 be i ; : fe 

u. hw | ¥ : : 

rT AA hh i 

Se a 
Lan iit 

i nh il Hi | fh $ 

2 AAA g : 

| i mn ‘it ~ ; 
| sg 
Ws oa ok ; / ‘ 

OU jt i 

s t A 
| 
4 
: 


8 JOURNAL OF THE AUDIO ENGINEERING SOCIETY 


rather a simple sentence to synthesize because it consists 
entirely of vowels or vowel-like sounds, and the voicing is 
continuous. The sentence is generated first with no pitch 
inflection and then with various inflection patterns. Remem- 
ber that these inflection patterns are obtained by utilizing 
only three values of the fundamental frequency. The next 
sentence is We are far, far away, generated first with no 
inflection and then with an appropriate inflection pattern. 
This sample illustrates the manner in which the synthesizer 
produces the unvoiced fricative |f|. Finally, we hear the 
sentence We wish we were far away, too, first in a monotone 
and then with some inflection. 

We have already noted that the synthesizer can be adap- 
ted for operation from continuous control signals. Some 
progress has been made toward the development of instru- 
mentation that will extract such control signals from con- 


tinuous speech.’* That is, it has been possible to build an 
electronic device that accepts natural continuous speech at 
its input and yields at its output signals that represent the 
frequencies of the first three formants, the frequency of vocal 
excitation, and the amplitude of noise and buzz excitation. 
Preliminary tests with this analyzing device connected to 
the synthesizer have shown that speech with fair intelligi- 
bility is generated at the output. Such an analysis-synthesis 
system utilizes a very narrow transmission bandwidth for 
the control signals and thus has application in the develop- 
ment of a bandwidth-compression system for speech. 


18 J. L. Flanagan, “A Speech Analyzer for a Formant-Coding Com- 
pression System,” Scientific Report No. 4, USAF Contract No. 
AF 19(604)-626, Acoustics Laboratory, Massachusetts Institute of 
Technology, (May 1955); also, “Automatic Extraction of Formant 
Frequencies from Continuous Speech,” J. Acoust. Soc. Amer., 28, 110- 
118 (January 1956). 
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Control of Amplifier Source Resistance’ 


Cuaries A. Witkins! 
David Bogen Co., Inc., New York, New York 


Many varieties of variable damping-factor control have appeared in commercial high-fidelity 
amplifiers during the recent past. This paper discusses and categorizes the different forms, intro- 
ducing an analysis of a general nature that is applicable to all present configurations. General 
equations are developed for the design of any type of circuit utilizing current-voltage feedback 
control of amplifier source impedance, whether it be fixed, semi-variable, or continuously variable. 
A new improved type of variable damping control characterized by a broad range of adjustment 


and simplicity of circuitry is described. 


INCE THE introduction of the variable damping control 

in March 1954, a great amount of interest has been 
shown both in the effects of damping on loudspeakers and 
in methods of controlling power-amplifier source resistance 
to obtain variable damping. The effects of damping on the 
loudspeaker have been treated elsewhere.+ The contention 
that some form of variable control is desirable is indicated 
by the fact that the majority of the quality power ampli- 
fiers now on the market incorporate this feature. The vari- 
ous control circuits take the form of several configurations. 
Described in terms of function, they are: 

A. Switch activated with selectable but fixed positions 
such as +2, +20, -2. 

B. Continuously variable, covering a range from the 
normal damping factor of the amplifier down to some lower 
positive value such as +20 to +0.1. 

C. Continuously variable, covering a range from the 
normal amplifier damping factor up to a higher positive 
or even a negative value (beyond infinity the values be- 
come negative) such as +20 to infinity or to —2. 

D. Continuously variable, covering a range from below 
to above the normal amplifier damping factor, such as 
+0.1 to -1.5. 

The expression “normal damping factor” is used here to 
denote the inherent damping factor of the amplifier prior to 
addition of current feedback. All variable damping controls 
in use at present operate on the principle of introducing a 
feedback voltage that is proportional to the current passing 
through the amplifier load; this voltage is mixed in con- 
trolled proportions with the overall negative voltage-pro- 
portional feedback voltage. 


* Received November 12, 1955. 
t Assistant Chief Engineer. 
t See bibliography. 


The relationship between source impedance and damping 
factor may be stated as follows: 
Z,= Fon 3 (1) 

DF 
where Z, is the amplifier source impedance, Z; is the load 
impedance, and DF is the amplifier damping factor. As is 
evident from eq. 1, an amplifier DF range of from +0.1 to 
~—1.5 (related to a 16-ohm load) is equivalent to a source 
resistance range of from +160 ohms to —10.7 ohms. 

The example given above pertains to the D-type of con- 
trol circuit which provides for introduction of controlled 
amounts of either negative current feedback or positive 
current feedback. Type A does this also but is not con- 
tinuously variable. Type B utilizes only negative current 
feedback while type C makes use only of positive current 
feedback. This paper treats only the two variations of 
the type D configuration, inasmuch as such an analysis is 
applicable to each of the other three types merely by re- 
striction. 


TWO POSSIBLE SYSTEMS 

Let us examine the mechanism upon which the two sys- 
tems of each of the aforementioned configurations are based. 
This can be clearly envisioned if we look at the method 
used to measure source resistance (or regulation). The 
source resistance of any circuit may be defined in terms 
of the variation in output voltage between loaded and no- 
load conditions. 


_ 4, (E,- Ez) 
E. 


where E, is the output voltage with no load and E, is the 
output voltage under load. 
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It may be seen by inspecting eq. 2 that we may choose 
whether we wish to keep either E, constant or E, constant 
as source impedance, Z,, is varied. In a system designed 
to keep E, constant, the damping control will have no effect 
on the output voltage under a condition of no load but will 
produce a variation in output voltage when the load is 
connected. In a system designed to maintain E, constant, 
the damping control will produce a variation in output 
voltage with no load but will have no effect on the output 
voltage with load connected. 


Eo-CONSTANT SYSTEM 


The E,-constant system can be used to advantage if a 
filter is employed in the current feedback circuit to restrict 
the gain variation and damping control to frequencies below 
roughly 300 cps. It is in this region that damping is most 
needed and most effective with the average loudspeaker. 
The gain variation can be utilized advantageously to equal- 
ize for low-frequency attenuation resulting from speaker 
overdamping or for low-frequency accentuation produced 
in an underdamped speaker. This system is shown in Fig. 1. 


Fic. 1. Equivatent cireuit of E,-constant system for gain analy- 
sis. 


CIRCUIT ANALYSIS 


Figure 1 shows the equivalent circuit of an amplifier 
driving load Z,. In the load return, a bridge circuit com- 
posed of Ry, Ryz, and R; has been inserted to sample a 
voltage that is proportional to the current through the load. 
When the slider of potentiometer R; is in position 1, the 
current feedback is negative and adds to the negative voltage 
feedback. In position 2 the current feedback is positive and 
subtracts from the negative voltage feedback. Position 3 
balances the bridge and there is zero current feedback. 


GAIN ANALYSIS 
The gain, A’, of a feedback amplifier is defined by 


L&= a (3) 
1-AA 


where A is the gain with no feedback and J is the feedback 
factor. Let the following assumption be made: 


RitR2>>R, 

Ri + R2 >> R; 

R; = Rn + Re 
R2 


Rit Re 
The voltage distribution about the loop can be expressed as 
AE = 1(R, + Z: + Ry) (4) 


The voltage, E, applied to the input of the amplifier is the 
sum of the signal e, and feedback E,, voltages. 


E=e,+ Ep (5) 


When the slider of R; is in position 3, no current feedback 
is introduced. The total feedback voltage Ey, is 


Ey, = 12,8 (6) 
Es, = —AEdg (7) 


where Az; is the feedback factor in position 3. Combining eqs. 
4, 6, and 7, 


and 


Zip 
= (8) 
Rg + Zi+ RB; 
When the slider of R; is in position 1, negative current feed- 
back is introduced. The total feedback voltage is 
Ep = 1[Z2,8 + Rn(1-8)] (9) 
Ex = -A Ev, (10) 


where A; is the feedback factor in position 1. Combining 
eqs. 4, 9, and 10, 


and 


_ 28 + Rn (1-8) 
Ro+2Z:+R; 


When the slider of R; is in position 2, positive current feed- 
back is introduced. The total feedback voltage is 


Ep =1[(2Z:+ Ry) B-Re(1-£)] (12) 
Exp = —AE)2 (13) 


where Az is the feedback factor in position 2. Combining 
eqs. 4, 12, and 13 


A= 


(11) 


and 


_(Zi+ Ry) B-Ra(1-B) 
Ry + Zi+ Ry 


A2 = 


(14) 


By substituting A,, A2, and As in general gain eq. 3, the 
gain A’ of the amplifier may be determined for each case. 
It can be seen from inspecting eqs. 1 and 2 that for a DF 


a): 
5: : 
Es | 
‘win ; 
2 I | 
rr 
ve = 
me 

5 | 
5, 
a ee | 
ce \ 

ve \ 
‘ha \ ‘ 
t 
oe 
pi ee . 
lira ; 
ne 
ae : 
a 4 
e 
a |_| Po 
> ; 
ma © IG 
q 2 | 
‘ : 7 ‘ 

R © 
a @ Rs 
» 
a 
4 ee 
ey 
; 

ye 

Fide | 
AN 
se 
: : PY 

a 

: as P| - | 

=. 
a 
a 

“uh vss i 

t 


range of from +2 to —2, the loaded output voltage variation 
will be 3:1, or 9.5 db as the slider of Rs is moved from 
position 1 to position 2. 


SOURCE IMPEDANCE ANALYSIS 


Figure 2 shows the same equivalent circuit as in Fig. 1, 
with the exception that the signal input terminals have been 
shorted and a current generator has been substituted for 
the load. The assumptions are the same as for the gain 
analysis. The voltage developed across the current gen- 
erator will be 


E, = 1R,-AE + IR, (15) 


With the slider of Rs in position 3, no current feedback is 
introduced. The voltage E applied to the amplifier input 
terminals is 


E=E,p (16) 
Combining eqs. 15 and 16 and solving for E,//, 
R,+R 
Za = bo Je thn f (17) 
1+ Ag 


With the slider of R; in position 1, negative current feedback 
is introduced. The voltage applied to the amplifier input is 


E=E,p-IRn (18) 
Combining eqs. 15 and 18 and solving for £,//, 
R R,;+ AR 
Za= o + Ry + ARn (19) 


1+ AB 
With the slider of R; in position 2, positive current feedback 
is introduced. The voltage applied to the input terminals is 


E = (IR, - AE)B + IRy2 (20) 
Combining eqs. 15 and 20 and solving for E;// 
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Fic. 2. Equivalent cireuit of E,-constant system for source-im- 
pedance analysis. 
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A (RoB + Ry) 
1+ Ap 


Zee = Rg + Ry - (21) 


E,-CONSTANT SYSTEM 


This system is capable of a greater range of control than 
the E,-constant system which has been described. Another 
obvious advantage over the E,-constant system lies in its 
ability to permit a constant amount of negative feedback 
to be maintained around the amplifier regardless of the 
setting of the damping control. Hence, amplifier distortion 
is not a function of control adjustment. The circuit con- 
figuration is shown in Fig. 3. 


CIRCUIT ANALYSIS 


It can be seen from Fig. 3 that the bridge is arranged so 
that the arm of potentiometer R; can sample at one end of 
its travel, position 1, a negative feedback voltage that is 
100% proportional to load current. At the other end of 
its travel, position 2, the arm samples combined negative 
voltage-proportional and positive current-proportional feed- 
back voltages. If the value of current feedback resistor, Ry, 
is selected to introduce the amount of negative feedback re- 
quired by the amplifier design, and if the difference between 
the negative voltage feedback introduced by the voltage 
divider consisting of resistors R,; and Rez and the positive 
current feedback introduced by resistor Ryz is made to have 
a net negative phasing and is equal in amplitude to the 
voltage developed across resistor R,;, the gain, A’, of the 
loaded amplifier will then remain independent of the position 
of the arm of potentiometer R3. 


GAIN ANALYSIS 


The assumptions made in the analysis of the E,-constant 
system are also applicable to the analyses which follow. 
Equations 4 and 5 also apply. 

Pure negative current feedback is introduced when the 
slider of R; is in position 1. The total feedback voltage is 


Ey = 1Rp (22) 
and En = - AEA, (23) 
where A, is the feedback factor in position 1. Combining 
eqs. 4, 22, and 23, 
R 
“ (24) 


4 = ——_—_———_ 

R, +2Zi+ Ry 
Combined negative voltage and positive current feedback 
is introduced when the slider of R; is in position 2. The total 
feedback voltage is 


En =1(2Z, + Ry) B-IRpe (25) 
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Fic. 3. Equivalent cireuit of Z,-constant system for gain analy- 
sis. 


and Ey, = —AE)2 (26) 


where Az is the feedback factor in position 2. Combining 
eqs. 4, 25, and 26, 


(ZR) B-Rre 
R, +2Zi+ Ry; 
The values for A; and A» given by eqs. 24 and 27 may be 
substituted in general gain eq. 3 for determination of gain, 
A’, of the amplifier. 
To make the gain, A’, remain constant as the slider of Rs 
is moved from position 1 to position 2, make 
Ai = Ao (28) 
Combining eqs. 24 and 27, we see that 
Rn = (Z:+ Ry) B- Re (29) 
R Ry 
or B = Rn + Rr (30) 
Z,+: Ry 


This can be stated in final form giving all parameters as 


— Ri (Rn + Rp) 
Zi 


Inspection of eqs. 2 and 3 will indicate that the change in 
output voltage throughout control rotation under no load 
will be the same for an equivalent DF range as that for the 
loaded E,-constant system, or 9.5 db for a range covering 
+2 to -2. 


Ag = 


(27) 


Ro (31) 


SOURCE IMPEDANCE ANALYSIS 


Figure 4 shows the equivalent circuit of Fig. 3 with signal 
input terminals shorted and a current generator substituted 
for the load. The assumptions are the same as in the gain 
analysis. The voltage developed across the current generator 
is given by eq. 15: 


E, = 1R,- AE + IR; (15) 
The slider of Rs is set at position 1 to introduce pure 
negative current feedback. The voltage, E, fed back to 
the input, is 
i= -IRy, (32) 
Combining eq. 32 with eq. 15 and solving for E,//, 
Za = Rg + Ret ARn (33) 
When the slider of Rg is set at position 2, the combined 
negative voltage-proportional and positive current-propor- 
tional voltages that are fed back can be expressed as 
E= (IR, - AE)B + IRy2 (34) 
Combining eqs. 34 and 15 and finding E,//, 
A(R R 
Zea = Ry + Ry- ( 9B + 72) (35) 
1+ AB 
which is, as may be expected, the same expression obtained 
for the source impedance of the Z,-constant system with the 
slider of R; in this position. 


CONCLUSIONS 


The foregoing presentation, in the author’s view, indi- 
cates that the E.-constant system is superior to the E,-con- 
stant system on the following points: 

1. System distortion remains constant and is independent 

of damping control adjustment. 

2. With load connected the output-voltage level does not 
vary with damping control setting. 

3. The range of source-resistance control is greater for 
the same values of the parameters R;, and Rye. This is 
an important consideration inasmuch as a finite frac- 
tion of the amplifier power output is dissipated in the 
current feedback resistors. 
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Fig. 4. Equivalent circuit of E,-constant system for source-im- 
pedance analysis. 
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The only possible objection to the E,-constant system as 
compared with the E,-constant system is that the former 
provides no means to equalize for speaker response. How- 
ever, such equalization must of necessity be a compromise 
if a single fixed network is employed. It has been found 


from experience that, in general, any such equalization that 
is required may be readily introduced by use of the bass 
tone control. More precise equalization, such as is obtain- 
able with an insertion device having a variable inflection 


frequency, may be desirable in some cases; however, such a 
need is dependent on the characteristics of the loudspeaker 
system. 
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The Use of Spectrograms for Speech Analysis and Synthesis” 


Joun M. Borst 
Haskins Laboratories, New York, N.Y. 


The use of spectrograms and a pattern playback for research on speech has the unique advantage 
that it permits the study of isolated acoustic cues for speech perception. This method has shown 
that the consecutive sounds of the language are usually so intimately connected that they cannot be 
separated and recombined in a different order without serious loss in intelligibility. Formant tran- 
sitions, which often characterize the consonants, are found to fall into well-defined patterns and 
groups. The knowledge gained by studies with synthetic speech has guided the construction of a 
speech synthesizer which copies the sounds generated by the human speech organs and which can 
be operated from the information found in spectrograms. 


INTRODUCTION 


STUDY of the perception of speech poses a number of 

questions: How does one speech sound differ from 
another insofar as the ear is concerned? What serves to 
specify acoustically any given speech sound? What are the 
minimum requirements for speech to be intelligible? What 
are the minimum units or “building blocks” of speech, or are 
there any? 

It is most helpful, in such an investigation, to have the 
speech in a permanent form so that one can examine its parts 
—take one part away or alter another and observe the effect. 
This is possible if the speech can be transformed into visual 
forms. So far, there are two different visual transforma- 
tions available, the oscillogram and the spectrogram. Of 
these two, the spectrogram has proved to be the more useful. 
It is, as may be seen in Fig. 1, very difficult to recognize and 
classify the fine distinctions between similar sounds in an 
oscillogram, whereas in a spectrogram one can recognize dis- 
tinctive patterns. The pattern for a given word is the same 
regardless of who spoke it, and its shape is as distinctive as 
the sound to which it corresponds. 

One would, of course, like to know the significance of in- 
dividual features—for instance, of the broad dark bands 
on the spectrogram: What would happen if the pattern con- 
tained only these bands and nothing else? Or what would 
happen if one of the bands were omitted or its curves 
altered? A direct answer is possible if one can modify or 
alter these patterns—or even make entirely artificial ones— 
and then turn them back into sound again to be evaluated 
by ear. For such studies, two translating devices are 


* Presented at the Seventh Annual Convention of the Audio Engi- 
neering Society, New York, October 12-15, 1955. 


needed: one to make a picture from the speech—the spec- 
trograph—and the other to turn a spectrogram into sound— 
the pattern playback. 

The spectrograph is already well known. The machine 
which we have used was made at our own laboratory. It 
differs from the Bell Laboratories spectrograph in construc- 
tion, though not in principle of operation. Our instrument 
records the spectrogram on film rather than on paper, and 
thus permits a greater dynamic range (36 db) without com- 


pression. Also, we use a longer sample of speech (12 sec- 
onds). 


THE PATTERN PLAYBACK 


The other translating machine, the playback, is illustrated 
in Fig. 2. A light source provides an intense thin line of 
light which is focused radially on a rotating disc. This 
“tone wheel” is a film negative which carries 50 concentric 
variable-density sound tracks. The inner one has 4 sine 
waves on it, the next one 2 * 4 sine waves, the next one, 
3 X& 4 sine waves, etc. Thus, by rotating the disc at 1800 
rpm (30 rps), we modulate the first 1/10th in. of the line 
at 120 cps, the next 1/10 in. at 240 cps, and the fiftieth seg- 
ment at 6000 cps. The line becomes, in a sense, the optical 
equivalent of a piano keyboard, though each of its keys 
plays only a single sine wave. 

The modulated line of light is focused on the moving 
spectrogram. If this is an original spectrogram, the trans- 
mission system is used, and the phototube is placed below it, 
so that whatever light passes through will be collected. For 
an artificial or hand-painted spectrogram, the reflection sys- 
tem is used, and the light collector is then placed above the 
spectrogram. Wherever there is any paint on the trans- 
parent tape, light will be diffusely reflected into the collector. 
This light is modulated at a frequency determined by its 
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Fic. 1. Oscillogram and spectrogram of the spoken words 
**Audio Engineering Society.’’ Note that the spectrogram gives 
a clearer ‘‘pieture’’ of the speech. 


position along the transverse, or frequency, axis of the spec- 
trogram. 

The machine is limited to a fixed pitch and one might ex- 
pect that it could not make “hissy” sounds. However, it was 
found that noise can be generated by painting patches of 
fine dots, and reasonably satisfactory s-like sounds can be 
made. 


A spectrogram of real speech will, when run through the 
machine, produce synthetic speech which is highly intelli- 
gible. So also will a hand-painted spectrogram (Fig. 3) de- 
rived from an actual one by copying only the main fea- 
tures. Some trial and error was necessary to find which 
features were important and how much the spectrographic 
patterns could be streamlined without impairing the intelli- 
gibility. Some of the dark bands in the upper part of the 
real spectrogram are a peculiarity of the particular speaker 
and are present whenever there is any voicing, but are not 
necessary for intelligibility. Not every speaker has these 
resonances, and some have them at other frequencies. 


Fig. 2. Simplified diagram of the pattern playback. (Repro- 
duced by courtesy of the American Journal of Psychology.) 


With a spectrogram which has been reduced to its barest 
essentials, autosuggestion may play a part in determining 
what the listener will hear. If the experimenter knows what 
the text is supposed to be, he will very often hear it no mat- 
ter how bad the sound may be. This effect was demon- 
strated with the spectrogram shown in Fig. 4. The lower 
line has been reduced to but a single “formant” and yet, 
when played through the machine, the sentence is still in- 
telligible—to those who know the text! 


FORMANTS 


One may observe in Fig. 3, for example, that a hand- 
painted spectrogram consists mainly of broad lines running 
sinuously from left to right. These broad lines are called 
formants. They represent a concentration of energy within 
a narrow frequency band—i.e., a buzz sound passing through 
an acoustic filter. Or one may consider that a resonant cav- 
ity in the vocal tract is being shock-excited by a puff of air 
every time the vocal cords open and close. Each time this 
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Fig. 3. A spectrogram and the corresponding hand-painted, 
simplified version. 
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Fic. 4. Two versions of a painted spectrogram differing in the 
degree of simplification. The text of the first version (A) is readily 


understood; the second (B) is intelligible to those who know the 
text, but not to others. 


happens, a damped wave results. We can imitate this be- 
havior electrically and generate a single formant by apply- 
ing a short pulse to a resonating circuit, and by then repeat- 
ing the pulse a hundred times a second or so. This produces 
a formant like those we see in spectrograms. 

The top line of Fig. 5 shows an oscillogram, a spectro- 


OSCILLOGRAM 


>pp 


FREQUENCY W CPS 


gram, and a conventional frequency analysis—or section— 
of such a formant. The repetition rate of the damped wave 
trains represents the pitch. On the spectrogram, and in the 
section, the distance between the harmonics corresponds to 
the pitch. The frequency of the damped oscillation is the 
formant frequency. On the spectrogram it is the place (on 
the frequency scale) of greatest density, and on the section 
it is the peak of the envelope. This envelope, of course, is 
the-resonance curve of the tuned circuit. It may be seen 
from the figure that there need not be a harmonic precisely 
at the peak; this would happen only if the formant fre- 
quency were an integral multiple of the pitch. 

On the second line of Fig. 5 is another formant with the 
same formant frequency but with a higher pitch. The har- 
monics have moved farther apart but the resonance curve is 
still the same. On the bottom line is a third formant with a 
higher formant frequency but at the same pitch as in the 
top line. The spectrogram now has the narrower spacing 
between harmonics, but its center of density has moved to 
a higher frequency, and the peak of the resonance lies far- 
ther out on the frequency scale. 

There is still a third independent variable, the rate of 
damping or the Q of the circuit, but it is of less importance. 


SPECTROGRAM 


| 


Fic. 5. Three aspects of a formant. The effects on the waveform, spectrogram and section 
of various combinations of pitch and formant frequency are shown for (1) low pitch and low 


formant frequency, (2) high pitch and a low formant frequency, and (3) low pitch and a high 
formant frequency. 
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The Q of vocal cavities is rather low—somewhere between 
2 and 15. It increases with frequency, though not propor- 
tionally, and it varies with the individual. 
A diagram of the circuit used to generate the formants COINS 
described in the preceding paragraphs is shown in Fig. 6. A striking characteristic of the spectrographic patterns of 
Figs. 1, 3, and 4 is that there is almost no place where the 
formant frequencies remain the same very long. Either 
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0.47 . e Py 
ome ae GT Sea they glide, as slowly but constantly changing vowels, or they ‘. 
iL move abruptly, as transitions which are no longer perceived 4 
war Set) sass m— as vowels but rather as consonants. Indeed, much of the . 
eat’ Se information in speech resides in these transitions, and the y 
coma : demonstration of their importance was one of the significant j 
% contributions of the synthetic method. Some consonants : 
- have, in addition to transitions, a burst of noise (examples . 
oT ¥ p, t, k); or a noise portion of rather longer duration (ex- F 
ample s, sh) or a weak steady-state formant (ex- fe. 
“a amples J, r, m,n). The transition which is present in ad- f 
: dition to the noise may or may not be important to the lis- 
$ tener. In the spectrogram of ga (Fig. 7), the noise burst 
i. and the transitions at the beginning of the formants, both 
play a part in the recognition of the consonant, but the part 
a played by the transitions is by far the more significant. An 


courmon voutace 
Fic. 6. Schematic of a formant generator consisting of a four- 
mesh phase-shift oscillator with insufficient gain to oscillate. Tubes 
replace the resistances in the phase-shifting mesh. ‘ 8 
= 


g a 
It consists essentially of a tuned circuit which can be ad- 
justed electrically, ie., by changing a voltage.! Since an in- 
ductance would become quite large at these low frequencies, = 
an RC circuit is used instead. The circuit is a phase-shift 
oscillator with the gain set too low for oscillation. The re- 
sistances in the phase-shifting network are then replaced by 
tubes, so that the circuit can be tuned by adjusting the grid 
bias on the different tubes. The oscillator is excited by 
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toothpick pulses—differentiated sawtooth waves, and the : = = —_ 8 
amplitude is controlled by a biased diode which passes more ’ a ———— 3 "4 
or less of the exciting pulses. By adding the signals from 8 + = a 
two or three such generators, one can make a vowel sound. ° ~ = 


It has long been known that vowels are identified by the 
frequencies of the two or three lowest formants. These 
formants occur at virtually the same frequencies regardless 

: : 0! 0.2 0.3 0.4 05 
of who speaks the vowels, although there are systematic vari- 

, . . TIME IN SEGONDS 
ations among men, women, and children, just as there are 

. sat? : ie. 4 Fig. 7. A spectrogram of the spoken syllable ga. The listener’s 
very evident brains = Pitch: Thus, the form identification of the consonant is due to the burst of noise as well 
ants for women’s voices are about 10% higher than for men, 5 to the transitions of the formants. (Reproduced by courtesy of 
although the pitch may be twice as high. (For a recent study Psychological Monographs.) 
of the formant frequencies of spoken vowels see an article 
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2Gordon E. Peterson and Harold L. Barney, “Control Methods 
1Millard E. Ames, “A Wide-Range Deviable Oscillator,’ Elec- Used in the Study of the Vowel,” J. Acoustical Soc. Amer., 24, No. 2, 
tronics, 22, No. 5, 96-100 (May 1949). 175-184 (March 1952). 
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important virtue of the hand-painted spectrograms is that 
their use permits the isolation of the effects of burst and 
transition simply by painting only one at a time. 

Careful studies have been made of the effect of varying the 
frequency location of the burst, or the direction, extent, or 
duration of the various transitions. A discussion of these 
experiments is beyond the scope of this paper, but can be 
found in recent articles in the Journal of the Acoustical So- 
ciety of America.®* 

The results of some of these experiments are given in Fig. 
8, which shows the best transitions for b, d, g, with seven 
vowels. Note that the transitions are different for each con- 
sonant-vowel combination. It follows that if one were splic- 
ing a magnetic tape and were to cut off the 4 part of bi and 


FREQUENCY IN CPS 


Fig. 8. Simplified spectrographie patterns for the consonants b, 
d, and g paired with each of several vowels. The transitions for a 
given consonant vary with the vowel which follows it. (Reproduced 
by courtesy of Journal of the Acoustical Society of America.) 


try to transplant it onto an a, it would not fit, and we would 
not get the sound ba, but rather something quite unrecog- 
nizable. Similar difficulties would be encountered with most 
other consonant-vowel combinations: When one cuts the 
consonant away and adds it to another vowel, it is either un- 
acceptable or it may even become another consonant. For 
instance, cutting the initial noise burst of p from pi and add- 


3F.S. Cooper, P. Delattre, A. M. Liberman, J. M. Borst, and L. J. 
Gerstman, “Some Experiments on the Perception of Speech Sounds,” 
J. Acoust. Soc. Amer., 24, No. 6, 597-606 (November 1952). 

4A. M. Liberman, P. Delattre, F. S. Cooper, and L. J. Gerstman, 
“The Role of Consonant-Vowel Transitions in the Perception of the 
Stop and Nasal Consonants,” Psychological Monographs, 68, No. 379, 
1-13 (1954). 


ing it to a vowel a could result in the sound ka rather than 
pads 

From all this it appears that one cannot separate individ- 
ual speech sounds and reunite them in a different order. 
This explains, for example, why a speech synthesizer depend- 
ing on pre-recorded single sounds—to be combined a la 
typewriter—is likely to give unsatisfactory speech. Also, 
one would expect that a phoneme recognizer (speech type- 
writer) would work better if it utilized the shapes of transi- 
tions as well as spectral distributions of energy. 


LOCUS 


One of the rules which is most helpful in painting or syn- 
thesizing speech is the concept of the locus; this gives a 
simple way to account for the many different transitions 
shown in Fig. 8. Thus for a d preceding any vowel, we can 
see from Fig. 9 that the second formant transition appears 
to have started from a virtual point, or locus, at about 1800 
cps; similarly, there is a locus for 5 and another for g. 
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Fig. 9. Patterns which illustrate the locus principle. The second 
formant transitions of all vowels appear to originate in a virtual 
point (i.e. at one frequency for d, as is shown in the figure, at an- 
other for b, and at a third for g). (Reproduced by courtesy of the 
Journai of the Acoustical Society of America.) 


5A. M. Liberman, P. Delattre, and F. S. Cooper, “The Role of 
Selected Stimulus Variables in the Perception of the Unvoiced Stop 
Consonants,” Amer. J. Psychol., 65, 497-516 (October 1952). 

6 Carol D. Schatz, “The Role of Context in the Perception of Stops,” 
Language, 30, 47-56 (1954). 
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Moreover, the locus for d is valid for other dental sounds, 
such as ¢ and n, and is approximately correct for s and z. 
We can, indeed, build up a rather complete “periodic” table 
of speech sounds and their acoustic cues.’ With this infor- 
mation, it is now possible to paint specttograms by rule, 
that is, without reference to an actual spectrogram of the 
utterance. It is even possible to paint in dialects, as the re- 
cording of Alabama with a Southern or a French accent 
demonstrated.t The pattern playback can also produce non- 
speech sounds when one paints arbitrary figures instead of a 
spectrogram; likewise, by drawing short horizontal lines, 
comparable to slits in a piano-roll, one can make a kind of 
“music.” 


APPLICATION TO RECORDING AND TAPE EDITING 


What are some of the practical uses of this knowledge? 
For one thing, it should help us to understand some of the 
odd things which happen in tape editing. Those who have 
had extensive experience report occasional changes in mean- 
ing when some portion of a word is removed or replaced. 
In practically all cases this might have been predicted. We 
have already seen that transitions cannot usually be trans- 
planted from one vowel to another. For similar reasons, if 
a portion of a transition is cut off, the remainder may re- 
semble the transition of another consonant, and it will 
sound like that consonant; or if a portion of a sibilant or 
fricative is cut off, one may also change the identity of the 
consonant. For instance, in one experiment, the word sha 
was recorded several times, and increasing portions of the 
initial noise patch were sacrificed by snipping away pieces of 
tape. When the progressively reduced versions were played, 
one noticed that the sound changed from sha to tcha, to tia, 
and to something between ta and ka. 

But although one cannot transplant a single speech unit 
from one place to another, it is often possible to do so with 
a syllable and thereby to change completely the meaning 
of a sentence. As one example, the word “reason” was 
changed to “treason” by rerecording “tree” and then using 
it to replace the first syllable of “reason.” In this way, the 
sentence “He’s got me up a tree, there must be some reason 
in this” was changed to “ . . . there must be some treason in 
this.” Correct spacing and a good splice are required. Fig. 
10 shows spectrograms of “tree,” “reason,” and both the 
faked “treason” and the spoken word “treason.” The splice 


7P. C. Delattre, A. M. Liberman, and F. S. Cooper, “Acoustic Loci 
and Transitional Cues for Consonants,” J. Acoustical Soc. Amer., 27, 
No. 4, 769-773 (July 1955). 

+ A demonstration was included in Mr. Borst’s presentation at the 
Sound Creation session of Audio Engineering Society’s Seventh Annual 
Convention. 
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is almost undetectable, even on the original spectrogram. 
The spectrograms of Fig. 10 contain portions of the sounds 
preceding and following the words. This was done in order 
to show any changes which might have occurred between 
syllables. 

The realistic recording of the fricative sounds s, sh, f, and 
th has long posed special problems, for reasons which are 
more or less evident from the frequency and intensity char- 
acteristics of these sounds. In general, they consist of a 
patch of noise together with a transition to the following 
vowel. The noise portion of s is practically all above 3600 
cps, while that of sk comes down lower but not below about 
2000 cps. The noise segments of these sounds are rather 
long and intense, and they play a major part in their recog- 
nition; the transitions, although often quite prominent in 
the spectrogram, seem not to be very important for recog- 
nition. The f and th sounds have noise that is weak, at fre- 
quencies that are not clearly specified; the recognition of 
these sounds depends less than for s, sk on the noise portions 
and more on the transitions—especially in distinguishing 
the two from one another. 

Unfortunately there is not much new to be said on the sub- 
ject of how to record and transmit these sounds. To be 
sure, one must not introduce noise at the wrong frequency 
due to intermodulation distortion; also relative intensities 
are important, so one should not boost one frequency region 
too much. Finally, since some of the noise components are 
weak, one should maintain a good signal-to-noise ratio. All 
of this is, of course, well known. 


OCTOPUS 


Another speech synthesizer with which we have worked 
was designed on the basis of information gained in the 
course of our research. This device does not translate spec- 
trograms directly into sound, but rather the operator sets 
the controls with the aid of spectrographic patterns. Octo- 
pus (a name chosen for reasons which may become evident 
from the description) consists of three formant generators of 
the kind shown in Fig. 6, connected in parallel. Each gener- 
ator can be excited by a pulse train or by white noise, or by 
both simultaneously. The exciting signals, supplied by the 
circuit of Fig. 11, are amplitude-modulated before they are 
applied to the formant generators, thus setting the relative 
loudness of the formants. The buzz generator can also be 
varied in frequency by a control voltage. 

Since all of the variables are voltage controlled, the instru- 
ment is potentially able to produce speech if one supplies all 
the required voltages at the proper time. One might obtain 
these control voltages from a painted tape, from a recording, 
or from a suitable analyzer which derives them directly from 
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TREASON (COMPOSITE) 


Fig. 10. Speetrograms showing how a composite ‘‘treason’’ 


speech somewhere far away. However, this particular in- 
strument was built to save labor in painting the many in- 
volved patterns which are used in our experiments and which 
often vary in but one detail, and consequently it obtains its 
control voltages from a set of potentiometers. 

The control voltages can be “programmed” or set up in 
advance for eight consecutive events, as shown in Fig. 12. 
A multiple-pole stepping switch then applies these preset 
voltages in the proper order. As an example, and taking 
but one variable—say the first formant frequency—there is 


TREASON (AS SPOKEN) 


was faked by tape editing. The sound was quite convine- 
ing and even the spectrogram does not clearly reveal the tampering. 


a voltage divider for each event by which one can set the 
frequency of the formant generator. In addition, when the 
switch moves, the control voltage changes rapidly or slowly, 
depending on the time constant of an adjustable RC circuit. 
This permits one to make the transitions needed for many 
of the consonants. Also, successive events can have differ- 
ent durations, which are also set by potentiometers. The 
other variables can be controlled in like manner. Fig. 13 
shows the control board with its many knobs. There is one 
knob for each variable, and they are all repeated eight times 
for the eight events. 
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SAW- TOOTH AMPLITUDE 
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Fig. 11. Buzz generator and modulator which drive one formant generator of the speech 
synthesizer, Octopus. 
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Fig. 12. Control voltages and stepping-switch connections of Octopus. 
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Some of the sounds, and the patterns corresponding to 


them, were demonstrated at the Speech Creation session by 
means of the following recorded commentary by André 
Malécot: 


“We shall demonstrate some of the things that Octopus 


Fig. 13. Control board of Oetopus, with functions indicated. 


can be made to do by having it produce a pattern which 
will be changed one phone at a time in anagram fashion. 

“The first pattern will produce the word box and each 
successive set of adjustments will result in a slightly 
different word. The accompanying chart (Fig. 14) 
shows the entire demonstration graphically. 

“For the first word, which is box, Octopus has been set 
up to play the pattern as it is shown at the top of the 
chart. 

“To change box to socks, we change the transitions of 
Fi, F2 and F3 at the beginning of event 4, eliminate 
the voice bar of event 3 and substitute hiss at 3600 cps. 


“To change socks to sacks, we change the steady state 
frequencies of event 4. 

“To change sacks to sack, we need only eliminate the hiss 
of event 8. 

“To change sack to sash, we eliminate the burst of event 
7, change the transitions of event 5, and add hiss at 
2400 cps in event 6. 

“To change sash to gash, we eliminate the friction of 
event 3 and change the transitions at the beginning of 
event 4 in formants 1, 2, and 3. 

“To change gash to gas, we eliminate the terminal transi- 
tions, those of event 5, and move the hiss from 2400 
cps up to 3600 cps. 

“To change gas to guess, we change the steady-state fre- 
quencies of event 4. 
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Fig. 14. Spectrographiec patterns and the corresponding words, 


which were spoken by Octopus in a recorded demonstration. 
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“To change guess to yes we change the rates of transition 
at the beginning of event 4. 

“The following sounds, based on the same pattern, illus- 
trate intonation variations of the word yes. The final 
yes is played with hiss substituted for buzz, as it would 
be when the word is whispered.” 

Finally, it was shown that the synthesizer could produce 

a singing voice. In fact, this is somewhat easier than pro- 


ducing the speaking voice, since the pitch variations required 
in singing are given by the musical score, whereas the in- 
tonation patterns used in speaking constitute a topic on 
which much additional research is needed. 


This work was supported in part by the Carnegie Corpora- 
tion of New York and in part by contract work for the U.S. 
Department of Defense. 
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A Program to Design a High-Quality Pickup* 


Wa ter O. Stantont 
Pickering & Company, Inc., Oceanside, New York 


HE ENGINEERING PROGRAM on which the design 
of our Fluxvalve pickup was based required about three 
years and was developed through four major phases: 

1. A thorough study of the present and probable future 
performance requirements for a pickup to do a “best” job 
of playing disc recordings. 

2. A complete theoretical analysis of pickup action, to 
guide basic design effort. 

3. Design of the pickup, according to the analysis, to meet 
the performance specifications established. 

4. Evaluation of the success of the design through very 
extensive use of the pickup in the field, under severe per- 
formance conditions. 

The four phases of the development program will be dis- 
cussed in the order indicated. 


1. PERFORMANCE REQUIREMENTS FOR A 
HIGH-QUALITY PICKUP 

Our study to establish the specifications for a “best” pick- 
up was naturally guided by our many years of experience 
in building pickups for a wide range of applications. Our 
production of pickups for recording studios, broadcast sta- 
tions, and home high-fidelity systems is widely known. Not 
so well known, probably, is our large-scale production of 
pickups for coin-operated phonographs and industrial “back- 
ground-music” systems. 

The requirements for pickups intended for these different 
applications differ in some important respects, while over- 
lapping in other major respects. By examining each appli- 
cation in turn, we can assemble a list of performance require- 
ments which covers all major uses of a high-quality phono- 
graph pickup. 


Recording Studios. In a recording studio, stability of 
response is important in a pickup, since it performs essen- 
tially a monitoring function. If the frequency characteristic 
of the pickup changes, for instance, a false report will be 


* Presented before the New York Section of the Audio Engineering 
Society, September 13, 1955. 
t President. 


given the operator on the performance of his cutting equip- 
ment. Distortion of all kinds must be very low. Frequency 
distortion in particular is troublesome, since it obscures the 
correlation between lacquer and master. 


Radio Stations. A broadcast station must have a com- 
pletely reliable pickup. If a pickup breaks down while “on 
the air” it is costly in audience goodwill and disrupts ex- 
pensive scheduling. Low record wear is another prime re- 
quirement of the broadcast station. The investment in rec- 
ords can be justified only if a record is usable many times 
without significant loss of quality. Again, low distoriion of 
all kinds is wanted. 


Home High-Fidelity Systems. In the home system, low 
record wear is obviously a top requirement. Distortion of 
all kinds must be very low. In addition, ease of installation 
and of stylus replacement are very attractive to the home 
user. 


Coin-Operated Phonographs. The user of coin-operated 
phonographs today wants distortion at the same low level as 
that in the best broadcast and home systems. In addition, 
long stylus life is very important to him, since stylus replace- 
ment is a major element of operating cost. The pickup must 
be reliable, since unscheduled service calls are a second major 
cost element. Very low record wear is not of top importance 
since the records are normally replaced at relatively short 
intervals. 


Background-Music Systems. Reliability, low stylus and 
record wear, and low distortion are all important in back- 
ground music systems. 


Summary of Requirements. From the foregoing, we can 
list five major specifications for a pickup that is to perform 
at top quality in all of the applications referred to: 


1. Very low distortion. 
2. Low record wear. 
3. Low stylus wear. 
4. Reliability. 

5. Stability of characteristics. 

These five specifications are inter-related in such ways 
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that a change in any one will affect one or more of the 
others, as will be shown in this paper. 

We can define each of the five specifications in more detail 
as follows: ; 


a. Low distortion. Three kinds of distortion are impor- 
tant: frequency distortion, waveform distortion, and phase 
distortion. Frequency distortion has usually been consid- 
ered any deviation from a flat response, exceeding 1 or 2 db, 
over the audible spectrum from about 30 to about 15,000 
cps. I will give reasons further on to support the notion 
that a flat mechanical frequency response substantially wider 
than the audible spectrum is desirable in a pickup with the 
design objectives outlined in this paper. 

Low waveform distortion is largely a function of proper 
design of the generator device and mechanical elements so 
that the electrical output is closely proportional to stylus 
velocity, in a magnetic or dynamic pickup, or to stylus am- 
plitude, in a piezo-electric or capacity pickup. A figure for 
acceptable waveform distortion has not been agreed on 
within the industry, and measurement techniques are so far 
from standardization that it is doubtful that comparative 
figures have much significance at the present time. How- 
ever, full skill must be applied to the design of the generator 
system so that it is virtually a linear device, assuring wave- 
form distortion of a few percent, at the most. Moreover, 
waveform distortion is closely related to frequency distor- 
tion, as will be discussed in more detail presently. 

Phase distortion is important in pickup action, but its 
analysis and control are very complex and would take us 
beyond the scope of this paper. But phase distortion is at 
a minimum when frequency and waveform distortion are 
minimized, so that the same design procedures apply in gen- 
eral to the problem of minimizing all three kinds of dis- 
tortion. 


b. Low Record Wear. Record wear takes a number of 
forms. One of the most important is the familiar breakdown 
of the high-frequency modulation in the groove. This is a 
function both of stylus size and, more especially, of the 
lateral mechanical impedance of the stylus tip. The tip is 
likely to “plow through” short-wavelength modulation, if the 
frequency of modulation is at or above the armature reso- 
nance. The very high lateral mechanical impedance at reso- 
nance and above resonance raises the force needed to move 
the tip high enough to overcome the elastic strength of the 
groove wall. Another important form of record wear is 
erosion of the groove wall by friction between stylus and 
groove. 


c. Low Stylus Wear. Stylus wear, as is generally known, 
takes the form of “flats” which develop on the stylus tip 
at the points which bear on the groove wall. The rate at 
which such flats develop is a function of the force between 
stylus and groove, and this force is principally a function of 


the lateral mechanical impedance of the stylus. Thus stylus 
wear and record wear are directly related; both are reduced 
when the laterzi mechanical impedance of the stylus is re- 
duced. The static vertical stylus force, or “tracking force”, 
also affects stylus wear and record wear, but to a much lesser 
extent than the mechanical impedance. 


d. Reliability. For reliability, a pickup must have stable 
impedance values, and be immune to the effects of moisture, 
temperature, rough handling, airborne impurities, and ex- 
traneous material entering the moving system by magnetic 
attraction or otherwise. The pickup must have the mechani- 
cal strength necessary to hold up indefinitely under the con- 
ditions of its intended use. 


e. Stability of Response. The variation of pickup per- 
formance over a period of time is caused by stylus wear, 
changes in mechanical constants due to fatigue of materials, 
changes in damping constants, etc. Such changes affect the 
performance in the octave from 10 to 20 kc most strongly. 


2. ANALYSIS 


Mathematical Foundation. As a transducer the phono- 
graph pickup belongs to the class which converts mechanical 
energy into electrical energy. It resembles most generators 
in principle only, however, since the mechanical input, in 
modern recording practice, varies in amplitude and in fre- 
quency over a range of approximately 10 octaves from 20 
cps to some 20,000 cps. 

Perhaps the easiest way to begin the analysis is to recog- 
nize that a pickup is made up of two systems, one mechani- 
cal and one electrical, each acting separately upon a me- 
chanical input function. In general, the mechanical system 
is independent of how the electrical system acts. The elec- 
trical system, though, depends upon the behavior of the 
mechanical. 

Diagramatically, we can represent the total system by 
means of the block diagram of Fig. 1. 


(~)— fi f, +4 f Fe. 


Fie. 1. Block diagram of the total mechanical and electrical 
system of a pickup. 


In this diagram f, is the complex function describing the 
record signal as it is acted upon by the geometric relation- 
ship between the stylus and groove,’ /,, is the mechanical 
transfer function of the stylus system and f, is the electrical 
transfer function of the pickup electrical system. F,, relates 


1J. A. Pierce and F. V. Hunt, “On Distortion in Sound Reproduc- 
tion from Phonograph Records,” J. Acoust. Soc. Amer., 10, 14-28 
(July 1938). 
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the elements—arm mass, cartridge mass, stiffness, and fric- 
tion—as they affect response to mechanical input signals; 
f, relates inductance, capicitance, resistance and generator 
constants as they affect the electrical response to the pickup’s 
mechanical response. 

Any observations made by instruments, or by ear, for that 
matter, must take into account all three functions and make 
some effort, at least, to determine their separate effects on 
observed response, since the output is a function of the ori- 
ginal signal operated on by all three, by f,, fm, and f,. Mathe- 
matically, we can describe these functions in any convenient 
form. Methods for handling f, are described in the Pierce 
and Hunt paper. F,, and f, must be set up according to the 
particular type of pickup being studied. For quantitative 
results, the constants of each particular pickup must be used 
in the equations that apply to the particular kind of pickup. 
The equations for crystal pickups differ from those of mag- 
netic design, various magnetic designs differ in details, etc. 
The point is that any critical observation upon which con- 
clusions are to be based must necessarily be guided by a 
clear grasp of the innermost workings of all the applicable 
functions. Measurements made and interpreted without 
benefit of such analytical guides must be regarded with sus- 
picion. 

Measurements. After analyzing our proposed design in 
terms of the functions f,, f,,, and f,, the next step is the very 
difficult one of deciding upon means to measure performance 
accurately. Measuring transducer performance is always a 
problem, since the input and output energy forms are differ- 
ent. Techniques for calibrating amplifiers are fairly well 
standardized, but methods for rating speakers and pickups, 
however, vary widely throughout the audio industry. 


Let me explain something about pickup measurements and 
why they are necessarily hazardous. To begin with, let us 
consider our function f, where g stands for geometry, as 
mentioned before. The mechanical energy source for the 
transducer is the phonograph record, a product of 50 or more 
years of painstaking and sometimes painful development and 
progress. Since pickups are intended for use with records, it 
seems logical to devise methods of test which utilize the 
so-called “test records”. Many commercial pressings are 
available on the market, intended to simulate all sorts of 
conditions, both real and unreal, that are imagined to exist 
in music records. However, the complexity of the function 
f,, reflecting the complex interaction of stylus and groove, 
makes great analytical care a necessity in interpreting meas- 
urements made with test records, whether of the “real” or 
“unreal” type. A few examples of the deceptive results that 
can be obtained with various test records will serve to dem- 
onstrate the point. 

Simple constant-velocity frequency records cut at too high 
an amplitude, at too high a frequency, at too slow a speed, 


or with too small a diameter, will play tricks with any pick- 
up, as shown in Figs. 2, 3, 4, and 5. 


ac =«< = 
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Fig. 2. High-end ‘‘roll-off’’ displayed by a simple constant- 
velocity frequency record, played back with a ‘‘flat’’ pickup. 


It is clear from these examples that the question which 
always plagues us, in the absence of a reference, is, what 
are we measuring—the record, the pickup, or both? The 
flat pickup measures the geometric tracing loss due to a poor 
fit between stylus and groove. The “peaked” pickup com- 
pensates for this loss. Is this good or bad? In my opinion 
it is bad, but the point has been argued. 

Frequency records with the RIAA? or NARTB*® high- 
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Fig. 3. An over-damped wide-range pickup or one with limited 
response will show even more ‘‘ roll-off.’’ 
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frequency characteristic can be even more difficult to cope 
with and some popular test records I know of are simply 


2 Record Industry Association of America. 
3 National Association of Radio and Television Broadcasters. 
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impossible geometrically. Unfortunately, however, to make 
matters really confusing, a pickup with poor frequency char- 
acteristics will often look good when tested with one of these 
records and the truly flat pickup will appear to fail miser- 
ably. Figures 6 and 7 illustrate this point. 
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Fic. 4. Two ‘‘wrongs’’ ean make a ‘‘right’’ if the over-cut 
record is played with a pickup having high-frequency ‘‘ gain.’’ 


Fortunately, there are a few good frequency records and 
with great care in using them and by avoiding wear, repro- 
ducible measurements can be made up to 20 kc. However, 
if the pickup has any serious peak in the operating range, 
the 20-kc calibration is quickly lost because of wear, and the 
record is no longer a precision tool. 

Phase distortion is highly important but almost unmeas- 
urable, and an adequate description of the subject is beyond 
the scope of this paper. 


1. 
One 


Fic. 5. A pickup with an excessive high-frequency peak will ex- 
hibit a response characteristic similar to the curve above. 
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Fig. 6. Properly traced, a sweep record should look like this, A 
pickup peaked in the right part of the spectrum will compensate 
for tracing losses and produce the proper curve, even though the 
pickup is not ‘‘ flat.’’ 


Tests for harmonic and intermodulation distortion are of 
the greatest value in the pickup-design laboratory. Never- 
theless, testing concepts with respect to both these kinds of 
distortion are undergoing radical changes in the light of 
modern recording practice—radical to the extent that the 
new tests resemble the old in name only. 


Fie. 7. A flat pickup, however, will show the tracing distortion 
that actually exists and, in the absence of more complete labora- 
tory data, will appear defective. 


I have tried to make it clear that when frequency measure- 
ments are used as a guide to design, performance evaluation 
or product control, a great deal of judgment must be exer- 
cised in order to reach realistic conclusions; this was illus- 
trated by several examples. Except for one or two commer- 
cially available pressings, our work has required that we 
make our own test records in order to carry out those meas- 
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urements which we regard as significant. 

Measurement records cut at selected levels and frequen- 
cies have enabled us to control the signal to a pickup in 
such a way as to eliminate some of the variables in f, and 
thus measure more closely the particular pickup variable 
under study at any given moment. 

Since the pickup’s output is a complex product of three 
different, but actually inseparable functions—as already 
noted—a variation in measured output may be ascribed to 
one or several causes, depending upon the amount of data, 
information, or experience possessed by the observer. Thus, 
when we eliminated variables in f, by carefully controlling 
the signal at particular frequencies, we were able to correlate 
frequency-response studies closely with our theory. In other 
words, our test methods and analytical mathematics were 
complementary and reinforced each other. 

As a result of our work along these lines, we have been 
able to learn a great deal about the important variables, how 
they are related to one another, and about how they affect 
the performance characteristics specified earlier. Frequency- 
response studies have been more revealing than any others 
we have conducted. It has been quite conclusively shown 
that record and stylus wear are directly related to the fre- 
quency-response characteristics of a pickup. It has also been 
proven that harmonic and intermodulation distortion in a 
particular pickup can be directly related to its frequency 
characteristics. 

Stability and reliability are, to a large extent, determined 
by mechanical design. It so happens, however, that in the 
case of the Fluxvalve pickup, improved frequency charac- 
teristics also “give an assist” to these virtues. 

It has been very clearly shown by our combined mathe- 
matical and laboratory studies that extended frequency re- 
sponse—at both ends of the spectrum—will result in lower 
stylus impedance and, therefore, in less record and stylus 
wear. It can also be demonstrated that if the pickup response 
is “flat” over its extended range, then record and stylus wear 
and all forms of distortion are reduced. One is forced to 
conclude that there is no choice but to specify “flat” re- 
sponse over the entire recorded spectrum—plus a margin 
for safety—and to provide for future progress. Today, this 
means response from 20 cps to over 20 kc. 

Any arguments about the limits of human hearing or the 
amount of music above 10 kc are completely irrelevant to 
the issue. The point is that wide-range recordings require 
pickups with mechanically flat response (20 cps to 20 kc 
plus) in order to keep wear and distortion to a minimum. 
Further, we have every reason to believe that loudspeakers 
will continue to improve and thus establish the absolute 
necessity for this kind of performance. 


3. DESIGN 


The foregoing analysis, plus field experience, pretty well 
dictated the specifications for a new kind of pickup. These 
requirements, including specifications we have already out- 
lined in the first part of this paper, are rather simply stated 
as follows: 


. Low tracking force. 


a. High output. 
c 


b 

. Turnover feature for two stylus sizes. 

d. Easily replaceable styli. 

e. Good temperature, humidity, and fungus characteristics 
for all climatic service conditions. 

f. Universal mounting for use in existing equipment. 

g. Rugged mechanical strength. 

h. Maximum freedom from clogging. 

i. Maximum stylus life. 

j. Minimum record wear. 

k. Uniformity among pickups. 
1. Good stability, particularly in the 10- to 20-kce range. 
m. Light weight. 

And, of course, last and most important— 

n. Low distortion. 

Once again, low distortion includes minimum frequency 
distortion. This means a response curve like that of Fig. 4. 
It also means low harmonic and intermodulation distortion 
measured at levels one can reasonably expect in modern 
recordings. As with frequency tests, our experience has 
shown that special records and techniques are required to 
obtain sensible measurements and these, in turn, must be 
interpreted. IM measurements of pickups are something 
quite different from those of amplifiers—not in theory, of 
course—but in practice. To be thorough, phase distortion, 
along with waveform distortion, should be considered, but 
these subjects are too extensive to be covered adequately 
in this paper. 

Once the design specifications had been firmly established, 
it became necessary to consider how to meet them. It was 
obvious that a completely new approach had to be developed. 
Existing designs of all kinds were studied; this included 
examination of both amplitude- and velocity-type pickups. 
The advantages and shortcomings of each were evaluated 
and it was concluded that the moving-iron magnetic type 
had a great many favorable attributes and no serious dis- 
advantages. Jt must be pointed out that once it had been 
decided that a new departure was in order, any type or kind 
of generating system could have been selected. 

Several important reasons for choosing the moving-iron 
type are 

a. It offers a virtually mass-less generating system. 

b. There is a wide choice of output impedances and levels. 
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c. High output is available if desired. 

d. The moving system can be extremely small. 

e. It meets most of the specifications and requirements set 
forth above. 

It should be understood that the principal objective was 
to develop a design that had the minimum practical stylus 
impedance while meeting the other requirements insofar as 
possible. 

Having once decided upon the moving-iron principle as 
the best avenue of approach, all manner of designs were 
explored and evaluated. The final result of this effort was 
a pickup that embodied literally all of the desirable features 
originally specified. This new product is shown in Fig. 8. 
The upper side elevation displays the styli in place and two 
extra separable replacement elements are shown in the lower 
views. Figure 9 is a three-dimensional representation of the 
pickup body, showing the magnetic-circuit structure. Figure 
10 depicts the stylus three-dimensionally. The number of 
parts has been kept to a minimum and the molded plastic 
body not only completely seals the electrical system, but 
holds all the parts rigidly together. 


4. PRODUCT EVALUATION 


About a year has elapsed since the first of these new units 


was put into service. During the ensuing period some 30,- 
000 pickups have gone into the field and well over 60,000 


Fic. 8. The Fluxvalve pickup. The upper side elevation displays 
the styli in place. Two extra replacement elements are shown be- 
low. 


Fic. 9. The Fluxvalve pickup body, showing the magnetic-cireuit structure. 
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stylus assemblies—two per pickup plus replacements. 

Most of the installations have been in automatic systems 
where equipment frequently runs on a continuous basis. 
Service conditions range from the Arctic to the Tropics. And 
submarine installations have demonstrated that the Flux- 
valve will actually “play under water”! 

Actual performance data reported back from the field has 
established that record and stylus life has been increased 
by at least a factor of two. In one of our laboratory life 
tests with sapphire styli, 10 45-rpm music records of ordi- 
nary commercial quality were played 2500 times per side 
for a total of 25,000 plays per stylus. At the end of the 
100-day continuous test period, the styli were examined and 
found to have flats of between 1 and 2 mils. The records 
which had gone the entire 100 days without the protection 
of an enclosure were still in fair condition even though they 
had gone many hundreds of hours with worn styli in an 
unprotected atmosphere. 

A review of the design specifications and “run-down” of 


how the Fluxvalve has checked out will serve to conclude 
the evaluation process. 


Fig. 10. The Fluxvalve stylus. 


Requirements 


Compliance with Requirements 


High output. 
Low tracking foree. 
Turnover feature for two stylus sizes. 


Good temperature, humidity and fungus character- 
isties for all climatic service conditions. 


Easily replaceable styli. 

Universal mounting for use in existing equipment. 
Rugged mechanical strength. 

Maximum freedom from clogging. 


Maximum stylus life. 
Minimum reeord wear. 


Uniformity among pickups. 
Good stability, particularly in the 10- to 20-ke range. 


Low distortion. 


Light weight. 


25 mv at 10 em per second. 
2 to 6 grams, depending on recorded levels and the arm used. 
Turnover type with 
0.001-in. radius diamond stylus 
0.0027-in. radius diamond stylus 
0.0027-in. radius sapphire stylus 
0.0005-in. radius diamond stylus 
Cartridge body is hermetically sealed and completely eneapsu- 
lated in a tough inert plastic. It is unaffected by climatic 
conditions. 
Styli easily replaceable without the use of tools. 
Mounting dimensions in accordance with RETMA standards. 
Sturdy construction for high degree of durability. 
No perceptible clogging effect as dust and dirt collected are 
in shear. 
Low moving mass, high stylus compliance. 
Low tracking force, and absence of any resonance insure long 
stylus and record life. 
Output and frequency response within 2 db. 
Flat response from 20 eps to 20,000 eps with no resonances, 
Frequency response and waveform distortion values are of neg- 
ligible magnitude as compared with older type pickups. 
Total weight of pickup, styli, and hardware 10 grams. 
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Some Experiences with Mass-Production Tape Duplicating” 


JANUARY 1956, VOLUME 4, NUMBER 1 


Cecir S. Brprackt 


National Association of Educational Broadcasters, Urbana, Illinois 


HE NAEB Radio Network is a cooperative organization 
operated by the National Association of Educational 
Broadcasters for its member stations which are owned by 
educational institutions, municipalities, and other public 
service agencies. Since its beginning in 1950, it has grown 
to the point where it is now serving nearly 100 member 
stations. It provides a program service of up to 9 hours 
per week distributed on magnetic tape. By the use of 2 
mass duplicators, each of which will produce 11 simul- 
taneous copies of a master tape, the Network turns out up- 
wards of 1200 tapes per week for shipment to its members. 


BACKGROUND AND HISTORY OF THE NAEB) 


We shall discuss the operation of the NAEB Radio Net- 
work below; first, perhaps a bit of history and background 
will be helpful. 

The National Association of Educational Broadcasters is 
a national professional association which represents the 
interests and needs of educational radio and television 
broadcasters. For the noncommercia! educational broad- 
caster, the NAEB performs services similar to those ren- 
dered by the NARTB (National Association of Radio and 
Television Broadcasters) to commercial broadcasters. 

One of the important services of the NAEB is the opera- 
tion of the NAEB Radio Network, which is an organization 
of 72 noncommercial radio stations owned and operated by 
active members of NAEB. The Network is directly re- 
sponsible to its members and their accredited representatives. 
Its programs are selected by committees appointed by 
the president of the NAEB. The programs are then dupli- 
cated and distributed to Network member stations by a 
staff employed by the Association. 


* Received October 2, 1955. Presented at the Seventh Annual Con- 
vention of the Audio Engineering Society, New York, October 12-15, 
1955. 

t Television Engineer, Engineering Service. 
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Over 50% of NAEB Network programs are produced by 
Network members. Other organizations, such as the Brit- 
ish Broadcasting Corporation, Radiodiffusion Television 
Francaise, Canadian Broadcasting Corporation, The Com- 
mittee for Free Asia, The Air Training Command, Cooper 
Union, and others supply the balance of its offerings. The 
NAEB has administered grants from the Fund for Adult 
Education and the Educational Television and Radio Cen- 
ter totalling $240,000 in the past four years to enable NAEB 
members and other educational organizations to produce out- 
standing educational programs. 

NAEB Network programs are primarily content oriented, 
although technical and format considerations are by no 
means forgotten. They differ from commercial programs 
in that they carry no advertising. They emphasize informa- 
tion and culture rather than entertainment, and include 
music, lectures, panel discussions, children’s programs, inter- 
views, drama and in-school programs. The Network supplies 
at least 8 hours of programming per week and, in addition, 
5 to 12 in-school series are distributed for use during each 
semester. 

How did this all start? Back in January, 1950, Seymour 
N. Siegel of WNYC, New York City’s municipal station, was 
deeply impressed by a series of programs that WNYC had 
broadcast called ‘““‘We Human Beings,” which was produced 
by the Lowell Institute. He thought other noncommercial 
stations might also like to put them on the air. The response 
to his inquiry was so great that within six months there 
were 35 stations using the service. Soon the problem of 
finances, personnel, and time involved became too great for 
the WNYC staff alone and the Division of Communications 
at the University of Illinois offered to assume custodianship 
of the Network. 

Through a grant from the W. K, Kellogg Foundation, the 
tape network and also NAEB Headquarters were both made 
possible. In January of 1951 the headquarters of the Net- 
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work moved to the University of Illinois campus in Gregory 
Hall where it is still located. A mass duplicator (Fig. 1) 
was purchased—one of the earliest designed and built by 
L. S. Toogood of Chicago and Francis Rawdon Smith of 
Washington. Late in 1953 a second duplicator was pur- 
chased and installed, at which time the first machine was 
given a thorough overhaul after over 2000 hours of service. 


THE DUPLICATING FACILITIES 


The Multitape 12-Channel Duplicator consists of a com- 
mon vertical-mandrel tape-handling unit (Fig. 2), with a 
16-in. flywheel coupled by multiple belts to a constant-speed 
motor. One of these channels is the playback channel, 
which leaves 11 channels available for duplicating. The unit 
accommodates RTMA standard tape reels up to 7 in. in 
diameter. A drive-motor control switch with overload re- 
lease, elapsed-time meter, and master tape rewind mechan- 
ism are incorporated. Special attention has been paid to 
ease of loading and unloading the machine. With practice, 
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Fig. 1. View of 12-Channel Multitape Duplicator. 
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Fig. 2. Lower portion of the duplicator showing motor, flywheel, 
and multiple-belt arrangement, 


this operation becomes very speedy so that it is possible to 
duplicate 44 700-ft tapes per hour. 

The mechanical unit was manufactured by L. S. Toogood 
Recording Company of Chicago, Illinois, while the elec- 
tronic equipment was made by Rawdon Smith Associates of 
Washington, D.C. 

The machine operates in the following manner: A master 
tape passes over the playback head and is driven between 
the mandrel and the rubber-tired idler. The output from 
the playback head is amplified, equalized at both the low- 
frequency and high-frequency ends, and passes to the 
record-head distribution network. A second output from 
the amplifier passes, via an L-pad attenuator, to a monitor- 
ing speaker. 


Bias Arrangements 
Bias current is generated by an RC oscillator and fed 
through a phase-splitting stage to a push-pull output 
amplifier coupled by a special transformer to the other 
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“leg” of the record-head distribution network. This net- 
work operates on the bridge principle, and for this reason 
the escape of bias into the audio-frequency amplifier and— 
correspondingly, the escape of audio frequencies into the 
bias system—are both negligible. No tuned filters or bias 
trap circuits are therefore necessary. 


Mechanical Arrangements 


Before duplication the master tape is rewound so that, 
during the duplication process, time is reversed and the 
beginning of the program is the last thing to be recorded on 
the duplicates. This is done in order to avoid the otherwise 
prohibitively time-consuming problem of rewinding all 
copies. 

The playback head and each of the 11 recording heads 
are equipped with separate position adjustments which per- 
mit individual setting of each head in all 3 planes. This 
feature, together with an exceptional tape-path guide, pro- 
duces a positive control of the tape path, and consequent 
stability of recording. In addition, this adjustment mech- 
anism provides for simple, accurate, and stable azimuth 
setting, so that the recorded track on all channels is precise- 
ly perpendicular to the direction of tape travel. 

Even tension on the tape unloading mechanism is main- 
tained by a special felt washer cooperating with an anodized 
aluminum surface. The dimensions and construction of 
this part of the mechanism are such that approximately 
constant release tension is maintained on all channels for 
varying tape diameters. This arises from the fact that the 
tension exerted on the tape at the release reel is a function 
of gravity, and hence the greater weight of a full reel (large 
tape diameter) results in larger frictional forces than are 
the case for a nearly empty reel (small tape diameter), 
which is obviously lighter. The machine was designed for 
use with the newest Minnesota Mining & Manufacturing 
Company heavy, professional 7-in. reel, with wide spokes, 
or with smaller diameter reels (5 in. or less) of standard 
pattern. 

The take-up mechanism is driven by a belt system from 
the mandrel. A rubber-loaded felt washer system is used 
to provide take-up tension. This system is independent of 
gravity and therefore of the type of reel employed. Dur- 
ing the early period of use of the equipment, however, take- 
up tension measurements must be taken at intervals and 
an adjustment made to bring the tension to the required 
figure. 


Equalization 


Special equalization is necessary in the duplication process, 
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since duplication takes place at a tape speed very much 
higher than that at which the original recording was made. 
Duplicates are made at a speed of 21.5 ips. Both high- 
and low-end equalization are necessary, since the process is 
essentially that of simultaneous playback and record. All 
equalization for the duplication process is provided in the 
special preamplifier which is supplied. A power amplifier 
with independent monitor level control drives (a) the head 
distribution network and (b) a special monitoring speaker. 
Provision is made for setting level, on a large, illuminated 
VU meter, by means of a calibrated precision attenuator. 
Provision is also made for checking the bias current fed to 
the record heads. 

A bias oscillator-amplifier unit is mounted (with the 
amplifier described above) in the special wall cabinet (Fig. 
3). Access to the wiring of both units is secured by drop- 
ping the front panels. Rear access is provided by opening 
the front of the wall cabinet to reveal the tubes, all rear 
connections, and preset adjustments. 

The built-in equalizer is adjusted so that the frequency 
response of any duplicate tape is within + 3 db of that of the 
master tape at any frequency from 50 cps to a frequency 
corresponding to a wavelength of 1 mil on the duplicates. 
Special equalization to 0.75 mil (i.e., 10 ke at 7.5 ips) can 
be provided. A variable equalizer, shown between the bias 
unit and the amplifier, may also be used when additional 
equalization is required. 

The harmonic content of a signal passing through the pre- 
amplifier and audio amplifier does not exceed 1% rms at 
any frequency from 50 to 20,000 cps when measured at a 
level such that the harmonic distortion introduced by the 
tape is 3% rms at 1000 cps. 

As stated previously, programs to be offered by the NAEB 


Fig. 3. 


Wall-mounted electronic equipment used with the Mul- 
titape Duplicator. 
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Network are selected by program committees. The original 
tape, containing the program to be duplicated, is used to 
make a so-called “master” tape from which the dvovlicates 
are made (Fig. 4). This “master” is made by playing the 
program on an Ampex 300 and feeding this into a second 
Ampex 300 which records the so-called “master” tape. Due 
to the fact that there is some variation in azimuth on tapes 
received from our various program sources, the azimuth of 
the playback head on the first machine is adjusted to give 
maximum high-frequency response, while the azimuth of 
the record head, which has been previously aligned by the 
use of a 1-mil alignment tape, remains fixed. 

We realize that this procedure produces a second-genera- 
tion tape as a “master.” However, we believe that it is 
the only way in which we can be sure we get maximum 
frequency response from the program tape. It also pro- 
vides us with a means of adding network opening and closing, 
of monitoring, timing and editing, if necessary. Since the 
program runs backwards on the duplicators and at three 
times normal speed, all that one hears on the duplicator 
monitor speaker is “monkey chatter.” By using the original 
program tape on the duplicator playback channel, we could 
eliminate one tape generation. However, due to our pro- 
cedure of playing the tape backwards on the duplicator, it 
would be very difficult to adjust duplicator playback head 
azimuth in order to secure maximum high-frequency re- 
sponse. 

Since there are no erase heads on the duplicators, all tapes 
are erased on a Cinema Engineering Company bulk-type 
eraser before they are loaded on the machines (Fig. 5). 

After the duplication run is completed, tapes are checked 
for output and quality, placed in boxes, numbered, and 
labeled. They are then placed in labeled bins for the 
appropriate Network station ready for packaging. On the 
first run of the day made on the duplicators, tapes from each 
duplicator channel are checked by playback on a Magne- 
corder. On subsequent runs, only one channel is checked 
per run; thus we get slightly more than a 9% sampling of 
the output. 


TAPE BOOKKEEPING 


The operation of the NAEB Radio Network requires con- 
siderable bookkeeping. By placing duplicated programs in 
the proper station bins after the program has been run, 
labeled, and boxed (Fig. 6), we get a double check when 
these are removed from the bin for packing in fiber cartons. 
The tapes are checked off the stations list when they are 
placed in the bins and again during the packaging operation. 

Tapes returned by stations to Network headquarters are 
also entered to the particular station’s credit. A new pro- 


Fig. 4. Making a ‘‘master’’ tape from program tape on Ampex 
machines. 


cedure has recently been inaugurated for incoming tapes. 
As the incoming package is opened, tapes are “candled” 
for multiple or bad splices (Fig. 7), a visual inspection is 
made to see if the tapes being returned are full-length tapes, 
and the particular station is credited with the number of 
reels returned. 

Since there is no way of marking the tape itself, we do 
not necessarily get back the same tape we ship out. Experi- 
ence has taught us that unless we permit this exchange, tapes 
pile up due to differing programming schedules in individual 
stations. By permitting an exchange of tape we can operate 
with a lower tape inventory. 


Fig. 5. Erasing and unpacking tapes. 
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. An overlapping splice. 
. A splice made with a “band-aid.” 
. Five splices in a 3-in. length of tape. 
. A gap splice. 
. A splice made on the oxide coating of the tape. 
. A wrap-around splice. 
. A double wrap-around splice. 
. An overlapping and untrimmed splice. 
9. A wrap-around splice made with transparent tape. 
10. A splice made with a half twist. 
There are probably other ways in which splicing should not 
be done; however, this affords a rough idea of what we 
encounter periodically. 


on aun & WH = 


THE PROBLEM OF STICKY TAPES 


Another of the problems encountered is sticky tapes. 
And with over 8000 in circulation some of them are bound 
to show signs of wear. This past summer has been a very 
humid one in Urbana. After about two weeks of very 
humid weather our troubles really began. Tapes—which, 
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Fic. 6. Placing boxed and labeled duplicates in station bins 
prior to packing for shipment. 
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Short Tapes 


Since a short tape may necessitate an extra run on the 
duplicator, reels found to be short are set aside and during 
slack periods, sufficient tape is added to fill the reels. In 
the near future, stations will mot receive credit for tapes 
returned if they are not full length. Tapes with many 
splices observed during the “candling” process are also set 
aside. These are inspected, multiple or poor splices are 
eliminated, and reels are filled out to proper length by the 
addition of tape. 

All tape passing this receiving inspection is then erased 
on the bulk eraser. 
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IMPROPER SPLICES 


It is surprising how such a simple operation as splicing 
tape can be done in so many ways (Fig. 8). On our board of 
examples of some of the many ways not to splice tape we find 
the following: Fig. 7. ‘‘Candling’’ tape. 
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Fic. 8. Ten ways in which tape should not be spliced. 


under normal conditions of temperature and humidity, 
might have passed through the tape guides of the duplicators 
—trefused to run. Some would break, others drag so that 
music duplication was impossible and speech impaired. A 
stop-gap solution was found in the use of calcium-chloride de- 
humidifiers. The use of three of these dehumidifiers in our 
duplicator room was found by actual measurement to have 
reduced relative humidity from 87% to 57%. We now 
have a one-ton air conditioning unit so that the calcium- 
chloride dehumidifiers are no longer necessary. 


THE FACTOR OF HUMAN NATURE 


Human nature is pretty much the same regardless of 
economic position, education, or any other classification. 
We recently received a number of tapes returned on metal 
reels, as well as a number of tapes of 1951 programs. In 
spite of this and of the impossibility of any strict control by 
the Network, in 4 years of operation and 10,367 tapes 
later, we can account for 8,524. This is an attrition of 1,843 
tapes, or 17.8% during this period. Some of this loss is due 
to exchange, some tapes wear out and must be dis- 
carded, others just disappear. We hope in time to improve 
on the 1954 record; during that year 270 tapes could not be 
accounted for, or less than one per day. 


RATES 


Beginning July 1, 1955, the NAEB Radio Network began 
its first year as a self-supported operation. As of this writ- 
ing, there are 72 stations on the Network which pay fees 
for Network service according to station power. Stations 
are classified as follows: (a) Up to 1 kw, (b) 1 to 5 kw, 
(c) 5 kw and over. The rates for these classifications are 
respectively $300.00, $500.00, and $800.00 per year. Sta- 
tions on the air the year around pay an additional 25% for 
in-school programs, or $2.00 per reel, whichever is less. 
School stations which do not operate during the summer 
receive the in-school programs at the basic rate. 

Present programming on the Network will average 8 hours 
per week; programming for the fall quarter will average 
around 10 hours per week. This means that at present the 
weekly output averages around 725 tapes while, within the 
next few weeks, output will be stepped up to somewhat 
more than 900 tapes per week. 

The Network has begun a program to improve the tech- 
nical quality of its output. The half-width heads which 
were the original complement of the duplicators are being re- 
placed with full-width heads (Fig. 9). A low-distortion 
audio oscillator and a noise and distortion analyzer, recently 
acquired, will provide the means for keeping headquarters 
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equipment operating at peak performance. The NAEB 
Engineering Committee is asking Network program sup- 
pliers to submit a test tape which, when analyzed, will pro- 
vide a check on the station’s recording equipment. From 
this tape we will be able to determine frequency response, 
head alignment, noise, distortion, and timing. 


CONTRIBUTIONS OF THE NAEB TO 
AMERICAN EDUCATION 


In its four years of operation the NAEB Network has 
undergone many changes. It has won many awards and 
other types of recognition. It has served as a model for 
both commercial broadcasters in America and comparable 
operations in several foreign countries. As a medium for 
the exchange of quality programs among its members, it 
serves university and college and school stations with pro- 
grams for adults and in-school use. The NAEB Network 
is enabling under-staffed and under-financed member sta- 
tions to devote more time and money to those programs 
which they do produce, thus raising the level of educational 
programs and increasing the respect which such program- 
ming enjoys in America. 
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Fig. 9. Full-width recording head and used half-track head which 
it replaces. 
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The Influence of Magnetic Pigment Properties 
on Coated Recording Media® 


Ernest W. Franck! anp Epwarp Scumiptt 
Reeves Soundcraft Corporation, Springdale, Connecticut 


The characteristics of a new type of synthetic iron-oxide magnetic pigment are discussed in its 
application to standard sound-recording tape, motion-picture magnetic film, and to the instru- 
mentation-tape fields. Discussed, too, are factors other than magnetic properties which affect tape 


performance. 


AGNETIC-PROPERTY measurements made of bulk 

powders can be used to predict the general electrical 
performance of tapes manufactured from them, but the pre- 
dictions will not always be accurate. This is not surprising 
if we consider the vast difference between loose bulk powder 
on the one hand and a carefully dispersed, thin layer of 
coated tape on the other. The very special kind of mag- 
netic history which the powder in a tape experiences, what 
with narrow slits, high-frequency bias, and steep flux gradi- 
ents is not duplicated in the bulk measurements: The latter 
are useful but they are by no means enough. The only way 
to tell accurately how a pigment will perform in a tape is 
to disperse the pigment, coat some tape, and then make 
electrical measurements of the tape. When this is done, it 
is not uncommon for the results of actual tape measure- 
ments to differ from the predictions made on the basis of 
bulk measurements. 

In this paper, we will discuss some of the factors which 
can affect the final electrical performance of a tape but which 
are not revealed by bulk measurements of magnetic prop- 
erties. 


AVERAGE PARTICLE SIZE 


Average particle size is a very important consideration 
but is hard to measure because the individual particles of 
the pigments used in magnetic recording are so small. 


* Presented at the Seventh Annual Convention of the Audio Engi- 
neering Society, New York, October 12-15, 1955. 
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Electron photomicrographs provide the best means of meas- 
urement, since the particles are too minute for optical 
microscopes. Workers in this field think there is some un- 
avoidable size selection or shape selection inherent in the 
methods generally used in preparing slides of very fine pow- 
der for electron microscopy. One research worker, long 
experienced in this field, cautions that he would have to pre- 
pare 1000 slides to be reasonably sure of his data on average 
particle size and distribution in one bulk powder. 

Variation in particle size will indicate inverse variations 
in the surface area of the powder. When the powder is 
dispersed in a liquid solution of binder material, the surface 
of the powder must be made wet by the binder. The pow- 
der with smaller particle size will be found harder to wet 
because of its greater total surface area. This smaller-sized 
powder may also require a longer time to disperse. 


PARTICLE SIZE DISTRIBUTION 


The manufacturer of a pigment cannot make all particles 
exactly the same size. Thus, in any powder sample we ex- 
pect a range of particle sizes from small through medium 
to large. 

Bulk measurements of the magnetic properties of pow- 
ders do not give any information as to the size distribution 
of the particles. If the powder contains some relatively 
large particles, we would expect magnetic properties differ- 
ent from those of powders containing relatively small parti- 
cles. If the pov der contains some large particles and some 
small ones, the bulk measurement gives only an average. 
Here, also, if there are more particles of larger size in one 


4 
F j 
: } 
= 
ae ee ee ; 
2 
7 
<7 q 
y: ’ 
i 
Y k ¥ 
ah, 
ial j 
eo 
k ee 
af 
<1 
re 
ws 
cs : 
: ee 
a, 
Be S| 
fs 
ts 
he 
Be 
ce i! 
E 
Ne 
= 
us 
ha 
oA 
ea 
ew! 
bi, 
oe 
s 
Ee) 
es 
Rie 
ki “ 
are 
a 
ay . 
tis} 
ee 
eam 
if ee 
ee 
n Bb ; 
fe 
ives 
es 
ns 
‘a 
1S 
re 
: 
aaa 
* . 
Gs 
ote 
. ' 
os 38 


powder than in another, the total surface area will be smaller 
and the rate and degree of dispersion may be affected. 


VARIATION IN PARTICLE SHAPE 

Particle shape can be important in several ways. Shape, 
as well as size, controls the area which must be wet. In 
addition, shape determines how compactly the particles can 
be arranged within the tape coating. Uniformly perfect 
cubes, for instance, could be packed in such a way as to 
fill the available space completely. Perfect spheres can 
never fill much more than about half the available volume, 
and irregular shapes can fill even less. 

Typical powders used for magnetic recording exhibit 
rather high bulking factor. The density of individual parti- 
cles is approximately 4.5, but the bulk-powder density is 
generally well below 1.0. This alone is an indication of ir- 
regular particle shape. 

With this in mind, we may speculate concerning what 
goes on in a pigment dispersion after it has been spread on 
the plastic base and during the time it is drying by evapora- 
tion of the volatile solvents. After coating and before the 
solvent has evaporated, the wet coating is much thicker than 
its ultimate dried thickness, generally at least 5 times 
thicker. We can imagine that it resembles a large tank of 
goldfish in a pet shop. It is full of elongated particles, 
and there are about 50 layers, from top to bottom. The 
fish are crowded, but do not often touch one another. If we 
now open a valve and drain the water from the tank, we 
will see that as the level of the water gets lower, the fish are 
crowded closer together. At some time a point will be 
reached when the uppermost fish stop descending because 
all of the fish below them are in the way. There is no more 
room. This is the point where we have the maximum possi- 
ble density of fish for this particular volume. If all the 
fish were different in shape or were non-uniform in size, 
the packing density would be affected. 

It is generally found that for each powder there is a 
maximum density of packing; the shape of the particles and 
the range of particle-size variation are the principal factors 
which determine this limit. 

Binder systems are available which would give good re- 
sults with closer pigment packing, if the pigment could be 
made to pack better. Thus far, however, the best pigments 
have resisted packing. 

Some work has been done with brute-force methods of 
dispersing. Densities can be increased in this way, but it 
has been found that, in general, when closer packing is 
achieved by the brute-force method, sensitivity and output 
at the longer wavelengths are increased, but at the shorter 
wavelengths there is no improvement. This may be due to 
several possible physical causes: the breaking of some of the 
pigment particles into smaller particles, the production of 
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internal strains, or the compacting together of numbers of 
particles into tightly bound groups. Furthermore, this type 
of dispersion can best be done in relatively soft binder sys- 
tems, and the finished product is likely to soften at tem- 
peratures which are only moderately high. 


ALIGNMENT 

Tape coatings generally exhibit some degree of particle 
alignment. Shape and uniformity of size have much to do 
with how closely the particles can be packed. The particles 
are generally elongated, and packing can be much greater 
when there is some alignment than with random orientation. 
A box of wooden matches offers a good analogy: It is able 
to hold the greatest number of matches when they are of 
uniform size. If the particles of a pigment are elongated 
—particularly if they are of fairly uniform size—it is to be 
expected that the magnetic properties will be affected by 
the degree of alignment present. Bulk powders have fairly 
random particle orientation; when a bulk powder is dis- 
persed and coated onto tape with some degree of particle 
alignment, the magnetic properties are different from those 
measured in the bulk unoriented condition. 

These various shape and size factors can also affect the 
rate of dispersion, which, in practice, has been found to vary. 
Other factors, of course, can influence the dispersion rate as 
well as the final performance of the tape. Surface effects 
in the powder, due to absorbed gases or residual impurities 
resulting from the manufacturing process, can greatly affect 
the dispersion cycle. 

The problem of determining when a powder has had op- 
timal dispersing is not simple, because the degree of dis- 
persion required for magnetic coatings which are to be used 
in sound recording is greater than that required in the 
protective-finishes field. The usual technique of determin- 
ing fineness of dispersion involves the use of a fineness gauge 
and microscopic examination. With typical magnetic dis- 
persions, these techniques are not adequate. Generally, addi- 
tional electrical improvement results when the dispersion 
process is carried beyond the point at which the aforemen- 
tioned measuring techniques have reached their limits of 
resolution. This means that to find optimum dispersing 
time for a given powder, it is necessary to run a series of 
tests with varied cycling, and then to choose the desired 
dispersion time on the basis of electrical measurements of 
test coatings. It is not enough merely to run all tests for 
prolonged periods, because some pigments have been found 
to be injured when milled past the optimum point; it has also 
been found that response at the shorter wavelengths suffers. 

Some of the necessary electrical tests of tapes involve 
measurements at very low levels; included in this group are 
tests for completeness of erasure and print-through, or layer- 
to-layer transfer. The print-through test involves a time 


2 
i 
: 
we 
| 
: 
: 


40 JOURNAL OF THE AUDIO ENGINEERING SOCIETY 


factor which cannot be included in tests of bulk powder. 
Considering the number of potential variables, surprises are 
to be expected. 

A case in point is the experience of the authors with a 
new powder material. Bulk tests did not show this new 
product to be different in any way from the conventional 
materials, but the early dispersion tests gave coatings which 
showed better sensitivity at short wavelengths. The pow- 
der, having higher solvent demand, behaved differently in 
our dispersing system and required extended milling cycles. 

We finally were satisfied that we had an improved prod- 
uct for use in striping film—a material possessing better 
short-wavelength sensitivity and better noise characteristics. 
Samples were sent to several of the motion-picture studios 
for field evaluation on CinemaScope equipment. 

Reports from Twentieth-Century Fox, Metro-Goldwyn- 
Mayer, Warner Brothers, and Universal were in agreement 
to the effect that the electrical characteristics of the new 
oxide were the same—insofar as bias requirements, sensi- 
tivity at 1000 and 7000 cps, and distortion were concerned 
—as the electrical characteristics of the materials they had 
been accustomed to using. The noise measurements reported 
by those participating in the tests indicated an improve- 
ment for the new oxide of 1.5 to 3 db. These field data 
correspond closely to the authors’ own measurements. All 


the motion-picture people commented that the actually ob- 


served noise improvement, as determined by listening tests 
in their review rooms, was considerably better than the im- 


provement which they thought should have been noted 
purely on the basis of their electrical measurements per se. 
Also, the noise advantage for the new dispersion was re- 
ported to be as much as 10 db better than that for the old 
dispersion. The review-room inspectors found a correspond- 
ing “increase in intelligibility” and a “silkier” characteristic. 

The reasons for these observed differences between meas- 
ured noise levels and those indicated by listening tests have 
not yet been determined. It is not a difference in spectral 
noise distribution. 

Intensive investigations of the noise functions of magnetic- 
recording pigments and processes have been triggered by 
the above data. We are confident that significant improve- 
ments in the noise functions of all magnetic-recording media 
are now feasible and that it is practical to take commercial 
steps in the direction of reduced noise on standard recording 
tape, as well as on magnetic film materials. 


CONCLUSIONS 

Bulk measurements of the magnetic properties of powders 
are useful when there is no great variation in particle size, 
shape, or distribution, and where no significant chemical and 
surface-area conditions exist. The accurate evaluation of a 
powder for sound-recording purposes demands that the pow- 
der be dispersed and a coating made for tests. Further, fac- 
tors such as print-through and the noise characteristics of 
particular powders cannot be predicted by present bulk- 
measurement techniques. 
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A Direct-Reading B-H Meter 


ANDREAS KRAMER 


Audio Devices, Incorporated, New York, New York 


In the manufacture of magnetic iron oxide for recording tape, frequent tests must be made 
in order to maintain uniformity of the product. 

The simplest and quickest method has hitherto been the use of a B-H loop tracer, where the 
hysteresis loop is traced on the screen of a cathode-ray tube. This involves centering the pattern 
and reading the results on a previously calibrated screen and requires time and experience. 

This paper describes a method of testing magnetic materials, particularly magnetic iron oxide, 
where the most important properties, ie., Bu, Br, Hm, and He are indicated directly on a meter 


with a calibrated scale. 


INTRODUCTION 


i tow QUALITY of magnetic-recording tape is determined 

in great part by the characteristics and uniformity of 
the iron oxide with which it is coated. In the course of an 
extensive oxide research program it became evident that 
the work would be greatly accelerated if we could make 
rapid and accurate measurements of the magnetic proper- 
ties of oxides which affect recording characteristics. 

The art of testing magnetic substance is very old. The 
instruments used for testing vary, depending on the size, 
shape, and kind of material to be tested. Materials in a 
solid form (such as laminations and toroidal cores) with 
high permeability and low coercive force, may be tested 
accurately with relatively simple equipment. Only a small 
number of ampere turns are required for saturation and 
the output from the test coil is high. 

The testing of high-coercive, low-permeability materials 
is more difficult. A strong magnetizing field is required and 
the output from the test coil is low. The air flux voltage 
from the test coil is very high and must be balanced out in 
order to get a satisfactory reading of the voltage induced 
by the sample under test. If the sample is in powder form, 
the test is further complicated, usually with a resulting loss 
in accuracy. 

The usual method of testing low-permeability iron oxides 
is that of filling a thin glass tube with the material and 


* Presented at the Seventh Annual Convention of the Audio Engi- 
neering Society, New York, October 12-15, 1955. 
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inserting it in the test-coil assembly. The induced voltage 
is then integrated, amplified, and applied to an oscilloscope, 
which shows the hysteresis loop for the sample. Some 
magnetic properties may be read off directly, while others 
may be deduced from several readings at various field in- 
tensities. However, the reading of an oscilloscope is tedious, 
tiring, and often difficult to do accurately: the pattern tends 
to drift and the readings vary among operators because of 
line thickness and parallax. 

It was evident to us that direct meter readings of mag- 
netic properties would be of help to production and quality- 
control personnel. Because of this, the present instrument 
was developed; it permits the more important properties of 
magnetic iron oxides to be read directly on appropriate 
meters. 

The B-H meter may be divided into two major sections— 
the section comprising the coil assembly and that comprising 
the electronic components. The latter may be separated into 
the ¢, or flux, amplifier and the H, or magnetizing-force, 
amplifier. Each of these is followed by meter circuits with 
their associated gates and gate-pulse generators. 


COIL ASSEMBLY OF THE B-H METER 
The coil assembly is made up of two sets of coils. The 
first set consists of a magnetizing coil (the exciting coil) with 
a test coil (the pickup coil) located at its center, as shown 
in Fig. 1. The second set is made up of the balance coils, 
whose purpose is to cancel the air flux voltage of the test 
coil. 
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Fig. 1. Exciting coil and wiring diagram. 


The exciting coil has approximately 2000 turns of 
heavy wire and the following approximate dimensions: 
length, 15 cm; outside diameter, 8 cm; inside diameter, 2.5 
cm. 

The equation for the magnetizing force at the center of 
the exciting coil is 


/239 
DP 
where /,,,, is the exciting current in amperes and H, is the 


| 


peak magnetizing force in oersteds. For precision tests, 
the coil is short, but the field intensity is constant within 
1% over a range of 4 cm at the center. This is nearly twice 
the length of the test coil. 

The test coil has 5000 turns of fine wire. It is layer 
wound, on a linen-base phenolic tube which has an outside 
diameter of 1 cm and an inside diameter of 0.6 cm. 

The primary winding of the balance coil has approxi- 
mately 1500 turns of heavy wire. This coil is only 10 cm 
long, and consequently it has a larger outside diameter than 
the exciting coil. These dimensions give a slightly greater 
field strength than that of the magnetizing coil for the same 
current. This allows the balance coil’s secondary to be 
adjusted positionally so that it will have a voltage induced 
in it equal to that of the test coil. 

The secondary winding of the balance-coil assembly, in- 
cidentally, is similar to the test coil, but is mounted in such 
a way that it can move axially inside the primary winding 
for the balance adjustment. 


Coil Connections 


The right-hand half of Fig. 1 shows how the coil assembly 
is wired. The magnetizing coil and primary of the balance 
coil are both connected in series with a 40-mfd capacitor, 


an ammeter and two 0.5-ohm resistors. (A conventional 


ammeter was used in the model. This will be replaced with 
a multiscale peak-reading voltmeter for more accurate read- 
ings at low values of magnetizing force.) The output 
voltage from the two 0.5-ohm resistors is proportional to 
the current in the coils and hence proportional to the mag- 
netizing force in the exciting coil. The range-selector switch 
is connected to the input of the H amplifier. The relative 
position of the two coil sets may be seen at the left-hand side 
of Fig. 6. 

The test coil and the secondary winding of the balance 
coil are connected in series opposing, so that when proper 
adjustments have been made, the output is but a few milli- 
volts with no sample in the test coil. 
nected to the input of the ¢ amplifier. 


The coils are con- 


THE AMPLIFIERS 


The input voltage to the H amplifier may be found from 
the coil equation. For an exciting field of 1000 oersteds the 
current in the 0.5-ohm resistors is 4.18 amps and the input 
voltage is then 2.09 volts rms for the 0-400 oersted range. 
The signal is then fed simultaneously to a pair of voltage- 
amplifier stages each of which is associated with a cathode 
follower. Coupling capacitors have been omitted to avoid 
phase shift where possible. One amplifier has a gain of 
10 and feeds the ¢, gate pulse generator. The other feeds the 
6AKS5 tube, which operates as a controlled clamp. This is 
shown in Fig. 2. 

The output voltage from the test coil is proportional to 
the rate of change of the flux ¢ in the sample under test. 
This output voltage has a very low rms value but peak 
readings of 12 volts have been encountered during tests. 
The integrator preceding the ¢ amplifier must therefore be 
able to handle these peak voltages. A separate cathode fol- 
lower precedes and follows the electronic integrator; this 
provides a desirable high input impedance for the test coil 
and a low output impedance to the input of the ¢ amplifier. 
The integrator with cathode followers is shown in Fig. 3. 
A 1-mfd capacitor in series with a 10-K control in the 
cathode circuit of the input tube permits a small degree of 
phase adjustment. The ¢ amplifiers are similar to the H 
amplifiers, except that one additional stage with a gain of 
10 has been added. The voltage divider preceding this tube 
is the meter multiplier and is divided into steps of one-half 
full scale readings. The reason for this arrangement is that 
the present gate-pulse generator will not operate satisfac- 
torily at a voltage which reads less than 1/3 full scale on the 
H, and ¢@, meters. The input to the H, gate pulse generator 
is 13 to 26 volts peak and the input to the ¢, clamp is 10 to 
20 volts peak. 
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A DIRECT-READING B-H METER 


© +250 


——~, ne 


Fig. 2. H Amplifier and its associated vrvm. 


METERING CIRCUITS 


1. General. Up to this point there is very little difference 
between the B-H meter and the conventionally used B-H 
curve tracers. Jacks are provided on the front panel and 
connected to the output of the ¢ and H amplifiers at the 
2-mfd capacitors for test and set-up purposes. An oscillo- 
scope connected at this point would show the hysteresis loop 
for the material. 

2. Measurement. The parameters measured by the peak- 
reading vtvm’s are illustrated in Fig. 4. Coercive force is 
measured at an instant when the net flux passes through 
zero. Remanent flux is measured when the magnetizing force 
is zero. The loop on the left indicates that the coercive force 
of the “individual particles” is‘very nearly the same at the 
higher flux densities and that there is a uniform change as the 
flux density goes through zero. This yields a flat top and 
straight, sloping sides. The timing for the ¢, gate pulse, in 
this case, is not very critical. The reading for the coercive 
force would vary somewhat if the H, gate pulse were ad- 
vanced or retarded. 


The hysteresis loop on the right, in Fig. 4, is very different. 
It indicates that the material has a great spead in coercive 
force or that the domains are unfavorably oriented and must 
be rotated by the magnetizing force. For the measurement 
of this material the timing is very critical and must be kept 
correct to a small fraction of a millisecond. 

3. Gate-Pulse Generators. The ¢, and H, gate generators 
are similar except that the H, gate requires a voltage 180° 
out of phase with the ¢, voltage, since the polarity of the 
H,, gate must be negative in order to operate the clamp. The 
H,, gate is obtained from the ¢ amplifier as the flux voltage 
passes through zero in the positive direction. The ¢, gate is 
generated by the H amplifier voltage as it goes through zero 
in the negative direction. A voltage of about 12 to 25 
volts is first clipped by a pair of biased diodes and then 
differentiated and amplified. Clipping occurs at about +0.5 
volt. The ratio of this voltage to the peak voltage is the 
sine of the angle through which it goes. Hence, the dura- 
tion of the voltage change—for 60 cps—is the ratio of this 
angle to 360°, multiplied by 16.6 millisec. This time inter- 
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val varies between 0.1 and 0.22 millisec. After differentia- 
tion in a 25 microsec circuit, the resulting voltage is ampli- 
fied and applied to a 6AKS clamp. 

Variation in the gating pulse width is not serious and has 
little effect on the accuracy of the B-H Meter. A change in 
starting time for the ¢, gate or stopping time for the H, gate 
is much more serious. In the case of the ¢, gate, it is operated 
when the H voltage is passing through zero in the negative 
direction. If this voltage is increased, the starting time for 
the negative change in the clipped wave is delayed because 
of the steeper wave front. On the other hand, in the case 
of a steeper wavefront, the output voltage from the differen- 
tiator is increased, since the output voltage is proportional 
to the ratio of the time constant of the network to the time 
interval over which the voltage is changing. The clamp 
requires about 4 volts to operate it, and the two conditions 
above help to cancel each other. 

The time of measurement still requires some compensa- 
tion because ¢, should be measured from the time when H 
is zero, and H, should be measured before @ reaches zero. 


j +250 


2 2 
INTEGRATOR 3!9K $56K 


The method applied for this correction is shown in Fig. 5. 
In general, the gate for ¢, must be retarded and the gate 
for H, must be advanced. A positive voltage Ey, applied to 
the clipper (Fig. 5) will, in effect, lower the reference line 
from 0 to 0'. One volt applied between the coupling capaci- 
tor and the limiting resistor will bring about a phase shift 
of several degrees. A combination of fixed and varying 
positive voltage is applied to both clippers, because the ¢, 
gate must be retarded and the H, gate must be advanced. 
It is important that the coupling capacitors (each marked 
with an asterisk in Figs. 2 and 3) have a stable leakage re- 
sistance of 100 megohms or higher. A fluctuating leakage 
resistance can have a serious effect on the timing of the 
gating pulses. 


PEAK-READING METERS 


The meters must be able to read peaks of very short dura- 
tion. The time interval during which the clamp is open 
is about 0.25 millisec, and it is closed for about 16 millisec. 


TO H, GATE 


Fic. 3. The ¢ Amplifier. 
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Fig. 4. Hysteresis loops. 


The peak-reading circuit, therefore, requires a very large 
time constant. An RC circuit having a time constant of 1 


sec was chosen. It discharges to 98.5% in 16 millisec and 
does not leave the meter action too sluggish. Only one vrvm 
is shown in the diagrams. 


POWER REQUIREMENTS AND POWER SUPPLIES 


The ac current required for the exciting coil and the 
balance coil to achieve a field of 1000 oersteds is 4.18 amps. 
The real power is then only about 175 watts, since the re- 
sistance of the coils is about 10 ohms. The reactive power 
can be decreased if necessary. 

The plate supplies are regulated and isolated as shown. 
The total current is about 70 ma for all tubes except the 
integrator and vtvm’s. The integrator requires a very well- 
regulated supply of about 25 ma for stable readings. 


OPERATION 


Figure 6 shows the location of the variable autotrans- 


former, meters, and switches. 


Fig. 5. Gate generator. 


A DIRECT-READING B-H METER 


Fic. 6. The B-H meter. 


The B-H Meter is simple in its operation once it has been 
properly adjusted. The H, meter may be calibrated directly 
by means of its own field-intensity meter and the variable 
5-K control in the vrvm. The ¢, meter may be calibrated 
with an external voltage source applied across the ¢ input. 
A speedier way of calibrating is by using a sample of mag- 
netic material whose properties are known. 

The value of ¢,, may be measured by turning a switch 
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Fic. 7. The hysteresis-loop tracer. 
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and reading the indication on the ¢, meter. The reading thus 
obtained is useful because the ratio of g, to ¢,, indicates 
the squareness of the hysteresis loop. It should be noted 
that all values obtained are intrinsic, since the air flux is 
balanced out. For the residual induction, however, there 
is only one value, since the magnetizing force is zero. 


CONCLUSIONS 


The data produced by this instrument have been of great 
help both in our oxide research and in the production of 


magnetic iron oxide in large quantities. The instrument has 
also afforded a quick and accurate procedure for production 
personnel, giving them a measure of control which is almost 
unobtainable by other means. For reseach work alone, the 
oscilloscope, as used in our laboratory (Fig. 7), still has 
some advantages. The use of the meter-reading instrument 
is recommended particularly in production, where immediate 
answers must be secured by semiskilled production operators. 
The meter reading instrument is also useful in some phases 
of research. 
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ADMISSION REQUIREMENTS 


A Memper, according to the constitution, may be any 
person active in audio engineering who has an academic 
degree or its equivalent in scientific or professional ex- 
perience in audio engineering or in a closely related field 
or art. Such a person shall, upon election, be entitled to 
all the rights and privileges of the society. 

An AssociATE MEMBER may be any person interested 
in the objectives of the Audio Engineering Society and, 
upon election, shall become entitled to all the rights and 
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office or chairmanship of standing committees. 
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audio engineering and enrolled in a recognized school, 
college, or university. A StupENT Memser is not eligible 
to vote or to serve on committees except in his own local 
section. 

Dues, according to the following schedule, should be 
enclosed with the application: 
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* If date of application is October 1 to March 31, use Column B; 
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THE PURPOSE OF THE SOCIETY 


The Audio Engineering Society is an organization rep- 
resenting the profession whose field is the recording, trans- 
mission, and reproduction of those frequencies audible to 
the human ear. Its objects are the advancement of the 
theory and practice of audio engineering and its closely 
related arts, and the dissemination of important informa- 
tion in this field to its membership. The Society seeks to 
continue the progress which has been made in audio en- 
gineering since the days of the original telephone and 
continuing through the early microphones, loudspeakers, 
and amplifying devices. Many of the electronic develop- 
ments familiar to us today, such as broadcasting, sound 
motion pictures, and radiotelephony, had their beginnings 
in audio engineering. 

Each year the Society gives the following awards: The 
Emile Berliner Award for an outstanding development in 
the field of audio engineering, the John H. Potts Memo- 
rial Award to the person who has made an outstanding 
contribution to the improvement of audio engineering, 


and the Audio Engineering Society Award to the person 
whose work has helped most the advancement of the 
Society in the past year. Honorary Memberships and 
Fellowships are awarded for past contributions to the 
profession. 


PUBLISHING 


Papers presented before conventions or meetings of the 
Society are often published in the Journal of the Audio 
Engineering Society, which is sent quarterly, without 
charge, to all members. Back issues of the Journal are 
available at reduced member prices. Occasional mailings 
are also made of news bulletins to members. 


MEETINGS AND SECTIONS 


The Audio Engineering Society holds conventions in 
the fall and winter of each year, during which technical 
papers are presented and exhibits made by manufacturers 
of equipment used in the audio field. No charge is made 
to members for admission to the convention. In addi- 
tion, local sections have been organized in various parts 
of the country, and abroad, which meet regularly for the 
presentation of technical papers and discussion of sub- 
jects of interest. Those members desiring to learn of 
sections in their vicinity or to form new sections should 
correspond with the Secretary. 


COMMITTEES 


The following committees are now active within the 
Society: Admissions; Awards; Constitution and By- 
Laws; Convention; Cooperative Research; Educational 
Standards; Employment Register; Finance; Historical; 
Lecture Course; Membership; Nominations; Papers Pro- 
curement; Publications; Public Relations; Sections; 
Standards. Qualified members wishing to serve on any of 
these committees should address the chairman of the 
committee involved, giving full details of their relevant 
experience. 


OFFICERS 


The Society annually elects a President, Executive 
Vice-President, Central Vice-President, Western Vice- 
President, Secretary, and Treasurer for one-year terms, 
and three members of the six-man Board of Governors 
for a two-year term. 
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Information for Convention and/or JOURNAL Authors 


SUGGESTIONS FOR PREPARING MANUSCRIPTS, PHOTOGRAPHS, CHARTS, 
DRAWINGS, AND LANTERN SLIDES; LETTERING SIZES, MAILING 


A Journal of original record. Papers orig- 
inally published elsewhere or promised for 
publication elsewhere are not accepted for 
presentation at Audio Engineering Society 
Conventions or for publication in the Jour- 
NAL OF THE AES. In rare instances, excep- 
och itty te tabi te TS Eaendion aed 
Publications Committees at their discretion. 

All papers presented at AES Conventions 
are automatically considered for publication 
in the Journat. 

Free and clear. It is assumed that all 
manuscripts submitted to the AES are offered 
free and clear. Any paper accepted for pre- 
sentation before the Society or for publica- 
tion in the Journat or tHe AES becomes the 
exclusive property of the Audio Engineering 
Society. Complete publication rights are held 
by the AES for primary publication in the 
Journat. 

Permission to reprint—in whole or in part 
—papers originally published in the Jourwar 
or tHe AES is usually granted freely by the 
Publications Committee upon written request 
and provided the authors agree. 


Multiple bylines. If the paper is to have 
more than one author, the exact form of the 


fessional or academic affiliation is to be given, 
your posi- 


Typing the manuscript. Only one side 
the sheet should be used. Margins should 


Subheads. Subheadings for important sec- 


ture should include the author’s full name, 
exact title of the article or paper cited, full 
name of the publication, volume number, 
page numbers, month, and year. Book refer- 
ences should include the author’s full name, 
full title of the book, the specific page or 
pages referred to, the publisher, place of pub- 
lication, and year of publication. References 
to patent literature should preferably be 


INSTRUCTIONS, ETC. 


is often advisable to include a separate pic- 
ture—i.e., a “‘closeup”—of any highly signifi- 
cant detail, in addition to the general view 
which describes the overall field. 

Drawings. The reproduction copies of 
sketches, of curves, or schematics (as distin- 
guished from any review copies submitted) 
should preferably be original drawings in 
India ink on white paper or on tracing paper, 
8% in. x 11 in. Curves made on conventional 
graph paper will reproduce poorly, but black 
India ink tracings, in which only the princi- 
pal cross-section lines are rendered, are satis- 
factory. 

Sharp, high-contrast photographs of line 
drawings are acceptable. 

Please do not send black-and-white lantern 
slides or color. transparencies for publication 
—only photographic prints or original draw- 
ings should be submitted. 

Lettering. On the aforementioned 8% in. 
x 11 in. sheets, lettering and numerical data 
should not be less than 0.12 in. high. Neces- 
sary labeling should be lettered onto curves 
and sketches. On the other hand, if extensive 
descriptive material is needed, it is better to 
put such information into the typewritten 
captions rather than to attempt to letter the 
information onto the curve or sketch. 

Mailing. Mail the two copies of your man- 
uscript as directed in the AES Convention 
story which appears on the inside front cover 
of this issue of the JournaL. 

Mail that copy of the manuscript which is 
accompanied by the reproduction copies of 
your illustrative material (the Editor’s copy) 
fiat, with plenty of stiff cardboard enclosed. 
It is advisable to mark the envelope 
“PLEASE DO NOT BEND.” 


ORAL PRESENTATION OF THE PAPER 
Time allotment. The average time allowed 
for any paper, unless the author requests 
more time on his Author’s Form, is 20 
minutes. If you are not going to deliver 


length version of their paper for publication, 
prepare an informal version for their guid- 


advance of the particular session for which 


Focilities. If you expect to need a black- 
board or any special facilities (electric power, 
large table or tables, etc.) please notify the 


charts, and curves intended to accompany 
oral delivery should be in the form of stand- 
ard American lantern slides (354 in. x 4 in.) 


PLEASE NOTE: Opaque 
paper prints or drawings is not recommended. 
Nor can we guarantee that facilities for this 
type of projection will be provided. Because 
of the dimness of the image, opaque projec- 
tion is unsatisfactory for large audiences. 

Lettering on lantern slides. Charts, dia- 
grams and curves from which lantern slides 
are to be made should be held within the pro- 
portions or overall dimensions of 7 in. x 10 
in., with no more than 20, and preferably 
only 15 words to a chart. Vertical Gothic 
capitals are recommended. 

For a 7 in. x 10 in. working area, the 
smallest desirable letter height is 0.14 in. 
(made with a Leroy pen 0 or Wrico pen 7). 

Recommended, too, are the following line 
widths: 

For curves—1%4 to 2 points or 0.021 
in. to 0.028 in. 

For grid rules—%% point or 0.007 in. 
For reference lines—1 point or 0.014 
in. 

Thumb marks. To indicate proper orienta- 
tion, a thumb mark should be placed in the 
lower left-hand corner of the slide, when the 
latter is viewed as it will appear projected on 
the screen. 

More complete information on lantern slide 
dimensions is given in American Standard 
Dimensions for Lantern Slides, Z38.7.19-1950. 


ee 

Ge 

< 

ee ‘ 

given as follows: name, number of the patent ‘a 

(U. S. or foreign), date of filing, and date of oi 

issue. A brief description of the patent is in 

helpful. 5 

Mathematical symbols. Care should be : = 

: your paper in person, please supply the name eS 
taken to make all mathematical expressions of te the a < 
clear to the printer. All Greek letters and ee er Convention Chair eS 

unusual symbols should be identified in early possible. es 

tec take, Kacey Go wry Gheint Seaatiie Special oral version. An informal version ee 

s of approximately 20 minutes, having an air ae 

should be typewritten: all others should be c, 

of spontaneity, is usually more effective than ‘ay 

written in carefully in ink. Do not neglect Bh 

ly iden’ = lied for rs, a su ing the forma - a 

Sic es | 

These captions should be listed—in complete ence dart ; it 

ing oral delivery. ; 

: form and consecutively—on a single sheet of D vations. elways a 
"Tice. at sini mines illinois =| 

maseoll OF aaaneuiy bs fad tad ual ouaede ow Ss bees er ae ey 4 

e in operating conditions. The Convention Com- rl 

ae the Leg me pap! a mittee will cooperate in every way within ” 

pared carefully to insure good reproduction. ; 

Photos should be standard 8 in. x 10 in. glossy “hal 

No osccsres in a rpevdection proces, 1 3 
: —_ : be obscured in all reproduction processes, it Convention Chairman as early as possible. OF 
oe be indicated on yt oy Lantern slides. Photographs, diagrams, ey 
considered correct re 

as given. re : 
Vital data. Make sure that the first page er 
of your manuscript carries your name pre- : : : ae 
cisely as you would like it rendered in the dimension of the projected area should be ia 
event of publication. If your business, pro- horizontal in the standard slide, but may be aa 
ee re asain es ee i 
2 in. slide. ae 

tion bears a formal title, you may include it. 2 
ee : 
be re ., 

at least one inch wide on each side. Triple a 
spacing is preferred. oa 
Reviewing of the manuscript is speeded up Bp” 
considerably if several copies are submitted. a 
(This is helpful but in no sense mandatory.) cal 
The copies may be clear carbons, mimeo- a 
graphed sheets, copies made on a “spirit” . 
duplicator, or blueprints. Review copies of “i 
i diagrams, schematics, and graphs may be te 
made by any convenient process. Bee 
Abstract. The author should precede his q 
text by an abstract summarizing the paper in a 
general. The abstract may include a sum- a 
mary of observations and conclusions set 7 
forth. a 
ee 4 
tions of the paper make it easier to read in 2 
printed form. fs 
References. References to periodical litera- = 
ae | 


SUSTAINING MEMBERS 


- Grateful hank’ 6f the Audio Raginsering Bocisty are hereby extended 
to the following organizations which, as sustaining members of the Society, 
help make this publication possible. These organizations are: 


Atec LaNsING CorPoRATION 
Ampex CorPORATION 

Aupio Devices, Inc. 

Aupio , 
Beit Sounp Systems, INc. 


Ber_LaNt CoNCERTONE, Aupio Division 
_ oF AMERICAN ELEcTRONICS, INC. 


Dav Bocen Co., INc. 

BairisH Inpustrizs CorPorATION 
Caprrro: Recoxps, INc. 
Corumsia Recorps, Inc. 

_ COMPONENTS CorPoRATION 
Tue Daven Company 
GorHaM Recorpinc CorporATION 
Harvey Rapio Company, INc. 
Hicu Fmeury, AupiocraFt 

' James B. Lansinc Sounn, INc. 
McIntosH Lasoratory, INc. 
OBERLINE, INC. . 

_ MEASUREMENTS CORPORATION 
PERMoFLUX CorPORATION 
PickerinG & Company, INc. 
Presto RecorpINc CorPoRATION 
Reeves Sounp Srunios, INc. 
Reeves SoUNDCRAFT CorPORATION 
Recency Division or L.D.E.A., Inc. 
Rex-O-Kut Company 
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